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This terms of use agreement (the "Agreement") governs your 
use of the collection of Web pages and other digital content 
(the "Collections") available through the Internet Archive (the 
"Archive"). When accessing an archived page, you will be 
presented with the terms of use agreement. If you do not 

:e to these terms, please do not use the Archive's 
Collections or its Web site (the "Site"). 

2ss to the Archive's Collections is provided at no cost to 
you and is granted for scholarship and research purposes 
only. The Archive, at its sole discretion, may provide you with 
a password to access certain Collections, provided that you 
complete any required application process and provide 
accurate information in your application. You may use your 
password only to access the Collections in ways consistent 
with this Agreement — no other access to or use of the Site, 
the Collections, or the Archive's services is authorized. You 
agree not to interfere with the work of other users or Archive 
personnel, servers, or resources. Further, you agree not to 
recirculate your password to other people or organizations or 
to copy offsite any part of the Collections without written 
permission. Please report any unauthorized use of your 
password promptly to infoO. archive.org . You acknowledge 
that you have read and understood the Archive's Privacy 
Policy and agree that the Archive may collect, use, and 
distribute information pursuant to that policy. If you provide 
any content to the Archive, you grant the Archive a 
nonexclusive, royalty-free right to use that content. 



Some of the content available through the Archive may be 
governed by local, national, and/or international laws and 
regulations, and your use of such content is solely at your 
own risk. You agree to abide by all applicable laws and 
regulations, including intellectual property laws, in connection 
with your use of the Archive. In particular, you certify that your 
use of any part of the Archive's Collections will be 
noncommercial and will be limited to noninfringing or fair use 
under copyright law. In using the Archive's site, Collections, 
and/or services, you further agree (a) not to violate anyone's 
rights of privacy, (b) not to act in any way that might give rise 
to civil or criminal liability, (c) not to use or attempt to use 
another person's password, (d) not to collect or store 
personal data about anyone, (e) not to infringe. any copyright, 
trademark, patent, or other proprietary rights of any person, 
(f) not to transmit or facilitate the transmission of unsolicited 
email ("spam"), (g) not to harass, threaten, or otherwise 
annoy anyone, and (h) not to act in any way that might be 
harmful to minors, including, without limitation, transmitting or 
facilitating the transmission of child pornography, which is 
prohibited by federal law and may be reported to the 
authorities should it be discovered by the Archive. 



You agree that we may contact you from time to time with 
surveys or other questions regarding your opinions about and 
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uses of the Archive, as well as with information we believe 
may be of interest to you. We encourage you to respond to 
these surveys because we value your input, which will assist 
us in improving the Archive. In addition, we request that, 
according to standard academic practice, if you use the 
Archive's Collections for any research that results in an article, 
a book, or other publication, you list the Archive as a resource 
in your bibliography. 

While we collect publicly available Internet documents, 
sometimes authors and publishers express a desire for their 
documents not to be included in the Collections (by tagging a 
file for robot exclusion or by contacting us or the original 
crawler group). If the author or publisher of some part of the 
Archive does not want his or her work in our Collections, then 
we may remove that portion of the Collections without notice. 

The Archive may immediately terminate this Agreement at its 
sole discretion at any time upon written notice (including via 
email) to you. Upon termination, you agree that the Archive 
may immediately deactivate any password it has issued to you 
and bar you from accessing the Collections or the Site. 

The Archive may modify this Agreement from time to time, 
and your continued use of the Collections and/or the Site 
constitutes your acceptance of any and all modifications. The 
Archive will attempt to notify you of substantial modifications 
via the email address that you have registered with us, if any. 

Because the content of the Collections comes from around the 
world and from many different sectors, the Collections may 
contain information that might be deemed offensive, 
disturbing, pornographic, racist, sexist, bizarre, misleading, 
fraudulent, or otherwise objectionable. The Archive does not 
endorse or sponsor any content in the Collections, nor does it 
guarantee or warrant that the content available in the 
Collections is accurate, complete, noninfringing, or legally 
accessible in your jurisdiction, and you agree that you are 
solely responsible for abiding by all laws and regulations that 
may be applicable to the viewing of the content. In addition, 
the Collections are provided to you on an as-is and as- 
available basis. You agree that your use of the Site and the 
Collections is at your sole risk. You understand and agree that 
the Archive makes no warranty or representation regarding 
the accuracy, currency, completeness, reliability, or 
usefulness of the content in the Collections, that the Site or 
the Collections will meet your requirements, that access to the 
Collections will be uninterrupted, timely, secure, or error free, 
or that defects, if any, will be corrected. We make no warranty 
of any kind, either express or implied. 

You agree to indemnify and hold harmless the Internet 
Archive and its parents, subsidiaries, affiliates, agents, 
officers, directors, and employees from and against any and 
all liability, loss, claims, damages, costs, and/or actions 
(including attorneys' fees) arising from your use of the 
Archive's services, the site, or the Collections. You agree that 
this Agreement is governed by California law and that any suit 
arising from this Agreement will be brought in San Francisco, 
California, and you further agree that on the election and 
reasonable notice of either party any litigation shall be referred 
to arbitration pursuant to the California Code of Civil 
Procedure, §§1280 et seq. In addition, you agree that should 
any provision in the Agreement be found invalid, unlawful, or 
unenforceable, that provision shall not affect the validity or 
enforceability of the remaining provisions. 

Under no circumstances, including, without limitation, 
negligence, shall the Archive or its parents, affiliates, officers, 
employees, or agents be responsible for any indirect, 
incidental, special, or consequential damages arising from or 
in connection with the use of or the inability to use the Site or 
the Collections, or any content contained on the Site or in the 
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Collections, or resulting from unauthorized access to the 
Collections or your transmissions of data, including, without 
limitation, damages for loss of profits, use, data, or other 
intangibles, even if the Archive has been advised of the 
possibility of such damages. Some jurisdictions do not allow 
the limitation or exclusion of liability for incidental or 
consequential damages, so some of the above may not apply 

This Agreement, the Privacy Policy, and other policies posted 
on the Site constitute the full and complete agreement 
between you and the Archive and are not intended to inure to 
third-party beneficiaries. 

We welcome your input. Please contact us with any 
comments or questions at info@archive.orQ . 

Privacy Policy 

10 March 2001 

The Internet Archive (the "Archive") is committed to making its 
constantly growing collection of Web pages and other forms of 
digital content (the "Collections") freely available to 
researchers, historians, scholars, and others ("Researchers") 
for purposes of benefit to the public. The Archive offers 
access to some of its Collections mainly by allowing 
Researchers to access its Unix machines. This open 
approach is somewhat like the situation in a public library, 
where staff and patrons might see who else was in the library 
and a bit of what they were working on. When Researchers 
using the Collections log on to the same Unix machine using 
different accounts, some sharing of information may take 
place. While the Archive endeavors to enforce its Terms of 
Use (http://www.archive.org/terms/index.html) and maintain 
standard computer security, it is important for both those who 
visit the site ("Visitors") and Researchers (collectively, 
"Users") to be aware of the open nature of the Archive. 

The Archive may make changes to this policy from time to 
time and will notify you of such changes by posting an 
updated date in the Terms, Privacy, and Copyright link at the 
bottom the home page of the Archive's Web site (the "Site"). 
Your continued use of the Site and/or the Collections 
constitutes your acceptance of any changes to the Privacy 
Policy concerning, but not limited to, both previously and 
prospectively collected information. 

What Personal Information May the Archive Have on Its 
Computers and Systems? 

Because the Archive uses standard Web logging in its Web 
servers, our Web server may automatically recognize the 
domain name of each Visitor, each Visitor's IP address, what 
Web page the Visitor requests, and the time of the request, 
along with a variety of information supplied by the visitor's 
browser. See www.microsoft.com and www.netscape.com for 
information about the Microsoft Internet Explorer and 
Netscape Navigator browsers, and see www.apache.org for 
details about Web logs. 

In addition, the Archive may collect the email addresses and 
messages of those who communicate with it via email or who 
enter email addresses in forms. 

The Archive may collect personally identifying information 
when a Researcher registers for access to the Collections, 
including the Researcher's name, address, telephone number, 
and email address, and the Researcher's proposal for using 
the Collections. 

The Archive may use "cookies" to track Users' activities on the 
Site and in the Collections. Cookies are small files that a 
server transfers to the hard drive of someone who visits a site 
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and that the server can access when the person returns to the 
site. 

The primary sources of content for the Collections are publicly 
accessible Web pages that were collected and donated by 
third parties, but the Archive will expand on such sources 
through its own collection activities. For instructions on 
removing a particular set of pages currently included in the 
Collections, please see our policies and procedures for page 
removal. 

The communications between you and the Archive may pass 
through many machines, operating systems, programs, 
browsers, Web servers, networks, routers, Ethernet switches, 
Internet service providers, proxy servers, intranets, the public 
phone system, or other devices (collectively, "Devices") on 
your premises, at the Archive, and in between. Some of these 
Devices create logs of activities that are recorded on 
computer systems. 

What Might the Internet Archive Do With the Information on Its 
Computers? 

The Archive has no present intention to charge for access to 
the Collections. The Archive may transfer the information on 
its machines, including personally identifying information, into 
the Collections. The Collections are made available to 
researchers and may be made available on the Site, or 
provided to third parties, for any use, without limitation. For 
instance, parts of the Collections are now in the collections of 
the Library of Congress and the Smithsonian Institution. 

Advances in data mining technology may make it possible to 
discover more personally identifiable information or profiles in 
the Collections. 

The Archive may disclose any information it collects from 
Users if the Archive believes in good faith that such action is 
reasonably necessary to enforce its Terms of Use or other 
policies, to comply with the law, to comply with legal process, 
to operate its systems properly, or to protect the rights or 
property of itself, its Users, or others. 

It is possible that the computers at the Archive could become 
compromised by others and that the information on the 
Archive's computers could be collected and disseminated 
without the knowledge or consent of the Archive. While the 
Archive endeavors to block "crackers" from breaking into its 
machines, the Archive is not responsible or liable for any such 
unauthorized uses of the Archive or its data. 

How to Update Researcher Registration Information 

Researchers can help the Archive maintain the accuracy of 
their information by notifying the Archive of any changes in 
their address, title, phone number, or email address. Contact 
the Archive by email at info(@archive.orq to see, update, or 
delete your information. 

Copyright Policy 

10 March 2001 

The Internet Archive respects the intellectual property rights 
and other proprietary rights of others. The Internet Archive 
may, in appropriate circumstances and at its discretion, 
remove certain content or disable access to content that 
appears to infringe the copyright or other intellectual property 
rights of others. If you believe that your copyright has been 
violated by material available through the Internet Archive, 
please provide the Internet Archive Copyright Agent with the 
following information: 

Identification of the copyrighted work that you claim has been 
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infringed; 

An exact description of where the material about which you 
complain is located within the Internet Archive collections; 
Your address, telephone number, and email address; 
A statement by you that you have a good-faith belief that the 
disputed use is not authorized by the copyright owner, its 
agent, or the law; 

A statement by you, made under penalty of perjury, that the 

above information in your notice is accurate and that you are 

the owner of the copyright interest involved or are authorized 

to act on behalf of that owner; and 

Your electronic or physical signature. 

The Internet Archive Copyright Agent can be reached as 

follows: 



Internet Archive Copyright Agent 

Internet Archive 
Presidio of San Francisco 
P.O. Box 29244 
San Francisco, CA 94129 
Phone: 415-561-6767 
Email: info @archive.org 

For More Information 

If you have any questions or comments regarding these terms 
and policies or the Archive's data collection practices, please 
contact the Archive at info(S).archive.org or Internet Archive, 
PO Box 29244, San Francisco, CA 94129-0244, phone 415- 
561-6767. 
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Frequently Asked Questions 

[ The Wayback Machine | Audio | Texts and Books | Live Music Archive | TJ 
Library Cards (AKA Accounts) | FreeCache | DocuComp | Prelinger Movies | S 
n | Rights [ Equipment [ A 

Questions 



What is the Wayback 
Machine? How can I 
get my site included in 
" e Wayback ' 
Machine? 

I have m y 
site's pa ges excluded 
from the Wayback 
Machine? 

What is the Archive-It 

Archive Wayback 
Machine? 

Can I link to old pages 
on the W.iyhack 
Machine? 

y isn't the site I'r 



Why is 
lookin g tor 
archive? 



The Wayback Machine 

What is the Wayback Machine? How can I get my site included in the Wayback Machine? 

The Internet Archive Wayback Machine is a service that allows people to visit archived versions of Web sites. 
Visitors to the Wayback Machine can type in a URL, select a date range, and then begin surfing on an archived 
version of the Web. Imagine surfing circa 1 999 and looking at all the Y2K hype, or revisiting an older version of your 
favorite Web site. The Internet Archive Wayback Machine can make all of this possible. 

How can I get my site included in the Wayback Machine? 

Alexa Internet has been crawling the web since 1996, which has resulted in a massive archive. 

Method 1 : If you have a web site, and you would like to 
you've searched wayback and found no results, you c; 
Open Directory site and add your site. 



when a site's archive 



Method 2: if you have the Alexa tool bar installed, just visit a site. 

Method 3: While visiting a site using the Internet Explorer browser, use the 'show related links' in Internet Explorer, 
which uses the Alexa service. 

In all cases, ensure that your site's Yobots.txt' rules and in-page META robots directives do not tell crawlers to avoid 



t he creation of the 
Internet Archive 
Wayback Machine? 



Yahoo is closing Geocities. Now what? 



What type of 
machinery is used in 
" 's Internet Archive? 



Yahoo also provides this information: http://help.vahoo.eom/l/us/vahoo/qeocities/close/ 
How can I have my site's pages excluded from the Wayback Machine? 

The Internet Archive is not interested in preserving or offering access to Web sites or other Internet documents of 
persons who do not want their materials in the collection. By placing a simple robots.txt file on your Web server, you 
can exclude your site from being crawled as well as exclude any historical pages from the Wayback Machine. 

le by both academic and non-academic digital repositories 



exclusions. What does 
that mean? 

How can I help the 



What is the Archive-It service of the Internet Archive Wayback Machine? 
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Internet Archive and 



am I getting broken or 
g ray image s on a site? 



What is the Wayback 
Machine's Copyright 
Policy? 



Do you archive email? 



Can I link to old pages on the Wayback Machine? 



Yes! The Wayback Machine is built j 
to reference on your Web page c 
date specification... but that's a bit m 



that it can be used and referenced. If you find an archived page that you would 
in an article, you can copy the URL. You can even use fuzzy URL matching and 
e advanced. 



Why isn't the si 



tn looking for in the archive? 



8 the automated crawlers were unaware of their existence at the time of the 
vere not archived because they were password protected, blocked by 
ur automated systems. Siteowners might have also requested that their sites 
be excluded from the Wayback Machine. When this has occurred, you will see a "blocked site error" message. When a 
" 's excluded because of robots.txt you will see a "robots.txt query exclusion error" message. 



What does it 



n when a site's archive data has been "updated"? 



When our automated systems crawi the web every few months or so, we find that only about 50% of all pages on the 
web have changed from our previous visit. This means that much of the content in our archive is duplicate material. If 
you don't see ""*"" next to an archived document, then the content on the archived page is identical to the previously 
archived copy. 

Who was involved in the creation of the Internet Archive Wayback Machine? 

"The original idea for the Internet Archive Wayback Machine began in 1 996, when the Internet Archive first began 
archiving the web. Now, five years later, with over 100 terabytes and a dozen web crawls completed, the Internet 
Archive has made the Internet Archive Wayback Machine available to the public. The Internet Archive has relied on 
donations of web crawls, technology, and expertise from Alexa Internet and others. The Internet Archive Wayback 
Machine is owned and operated by the Internet Archive." 



Howw 



5 the Wayback Machine made? 



'Who ho-, a cess tc the 



got hacked or 
■ ia ged, could I get a 

kup from the 
Archive?' 

Can people download 
sites from the 
Wayback? 

How do you protect 
my privacy if you 
archive my site? 



How large is the Wayback Machine? 

The Internet Archive Wayback Machine contai 
terabytes per month. This eclipses the amount 

What type of machinery is used in this Internet Archive? 

Much of the Internet Archive is stored on hundreds of slightly modified x86 servers. The computers run on the Linux 
operating system. Each computer has 512Mb of memory and can hold just over 1 Terabyte of data on ATA disks. 
However we are developing a new way of storing our data on a smaller machine. Each machine will store 1 terabyte. 
For more information go to www. petabox.org . 

How do you archive dynamic pages? 

There are many different kinds of dynamic pages, some of which are easily stored in an archive and some of which fall 
apart completely. When a dynamic page renders standard html, the archive works beautifully. When a dynamic page 
' ns forms, JavaScript, or other elements that require interaction with the originating host, the archive will not 
n the original site's functionality. 

are some sites harder to archive than others? 



Why are there no 
WaybackMach J^? 6 



• Robots.txt- We respect robot exclusion headers. 

• Javascript - Javascript elements are often hard to archive, but especially if they generate links without having 
the full name in the page. Plus, if javascript needs to contact the originating server in order to work, it will fail 
when archived. 

• Server side image maps - Like any functionality on the web, if it needs to contact the originating server in order 
to work, it will fail when archived. 

• Unknown sites ~ The archive contains crawls of the Web completed by Alexa Internet. If Alexa doesn't know 
about your site, it won't be archived. Use the Alexa Toolbar (available at www.alexa.com ). and it will know about 
your page. Or you can visit Alexa's Archive Your Site page at 
http://pages.alexa.eom/help/webmasters/index.html#crawLsite . 

• Orphan pages ~ If there are no links to your pages, the robot won't find it (the robots don't enter queries in 
search boxes.) 

As a general rule of thumb, simple html is the easiest to archive. 
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Some sites are not available because of robots.txt or other exclusions. What does that mean? 



The Standard for Robot Exclusion (SRE) is a means by which web site owners can instruct automated systems not 
crawl their sites. Web site owners can specify files or directories that are disallowed from a crawl, and they can evei 
create specific rules for different automated crawlers. All of this information is contained in a file called robots.txt. While 
robots.txt has been adopted as the universal standard for robot exclusion, compliance with robots.txt is strictly 
voluntary. In fact most web sites do not have a robots.txt file, and many web crawlers are not programmed to obey the 
instructions anyway. However, Alexa Internet, the company that crawls the web for the Internet Archive, does respect 
robots.txt instructions, and even does so retroactively. If a web site owner decides he / she prefers not to have a web 
crawler visiting his / her files and sets up robots.txt on the site, the Alexa crawlers will stop visiting those files and will 
make unavailable all files previously gathered from that site. This means that sometimes, while using the Internet 
Archive Wayback Machine, you may find a site that is unavailable due to robots.txt (you will see a "robots.txt query 
exclusion error" message). Sometimes a web site owner will contact us directly and ask us to stop crawling or 
archiving a site, and we endeavor to comply with these requests. When you come accross a "blocked site error" 
message, that means that a siteowner has made such a request and it has been honored. 



Currently there is no way to exclude only a portion of a si 



:o exclude archiving a site for a particular time period 



How can I help the Internet Archive and the Wayback Machine? 

The Internet Archive actively seeks donations of digital materials for preservation. If you have digital materials that may 
be of interest to future generations, please let us know by sending an email to info at archive dot org. The Internet 
Archive is also seeking additional funding to continue this important mission. You can click the donate tab above or 
click here . Thank you for considering us in your charitable giving. 

Can I search the Archive? 



Using the Internet Archive Wayback Machine, it is possible to search for the names of sites contained in the Archive 
(URLs) and to specify date ranges for your search. We hope to implement a full text search engine at some point in 
the future. 



Where is the rest of the archived site? Why am I getting broken or gray images on a site? 

Broken images (when there is a small red "x" where the image should be) occur when the images are not available on 
our servers. Usually this means that we did not archive them. Gray images are the result of robots.txt exclusions. The 
site in question may have blocked robot access to their images directory. 

You can tell if the link you are looking for is in the Wayback Machine by entering the url into the Wayback Machine 
search box at archive.org (http://www.archive.org/web/web.php ). Whatever archives we have are viewable in the 
Wayback Machine. 

The archived webpages are meant to be a "snap shot" of past Internet sites. Please note that while we try to archive 
an entire site, this is not always possible. That is why some images or links might be missing. Additionally some sites 
do not archive well and we cannot fix that. There is a list of common problems that make a site difficult to archive: 
http://www.archive.Org/about/faqs.php#12. 

If you see a box with a red X or a broken image icon that means that we unfortunately do not have the images. Files 
over 10MB are not archived in this "snap shot" of the website. 

The best way to see all the files we have archived of the site is: http://web.archive.org/Vwww.yoursite.com/* 

Please note that there is a 6 - 14 month lag time between the date a site is crawled and the date it appears in the 
Wayback Machine. 

How do I contact the Internet Archive? 



All questions about the Wayback Machine, or other Internet Archive projects, should be addressed to info at archive 
dot org. 

What is the Wayback Machine's Copyright Policy? 

The Internet Archive respects the intellectual property rights and other proprietary rights of others. The Internet Archive 
may, in appropriate circumstances and at its discretion, remove certain content or disable access to content that 
appears to infringe the copyright or other intellectual property rights of others. If you believe that your copyright has 
been violated by material available through the Internet Archive, please provide the Internet Archive Copyright Agent 
with the following information: 
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• Identification of the copyrighted work that you claim has been infringed; 

• An exact description of where the material about which you complain is located within the Internet Archive 
collections; 

• Your address, telephone number, and email address; 

• A statement by you that you have a good-faith belief that the disputed use is not authorized by the copyright 
owner, its agent, or the law; 

• A statement by you, made under penalty of perjury, that the above information in your notice is accurate and 
that you are the owner of the copyright interest involved or are authorized to act on behalf of that owner; 

• Your electronic or physical signature. 

Internet Archive uses the exclusion policy intended for use by both academic and non-academic digital repositories 
and archivists. See our full exclusion policy . 

The Internet Archive Copyright Agent can be reached as follows: 

Internet Archive Copyright Agent 

Internet Archive 

Presidio of San Francisco 

P.O. Box 29244 

San Francisco, CA 94129 

Phone: 415-561-6767 

Email: info at archive dot org 

Why is the Internet Archive collecting sites from the Internet? What makes the information useful? 

Most societies place importance on preserving artifacts of their culture and heritage. Without such artifacts, civilization 
has no memory and no mechanism to learn from its successes and failures. Our culture now produces more and more 
artifacts in digital form. The Archive's mission is to help preserve those artifacts and create an Internet library for 
researchers, historians, and scholars. The Archive collaborates with institutions including the Library of Congress and 
the Smithsonian . 



Do you archive email? Chat? 

No, we do not collect or archive chat systems or personal email messages that have not been posted to Usenet 
bulletin boards or publicly accessible online message boards. 

Do you collect all the sites on the Web? 

No, we collect only publicly accessible Web pages. We do not archive pages that require a password to access, pages 
tagged for "robot exclusion" by their owners, pages that are only accessible when a person types into and sends a 
form, or pages on secure servers. If a site owner properly requests removal of a Web site through 
http://www.archive.org/about/exclude.php . we will exclude that site from the Wayback Machine. 

Is there any personal information in these collections? 

We collect Web pages that are publicly accessible. These may include pages with personal information. 
Who has access to the collections? What about the public? 



Anyone can access our collections through our website archive.org. The web archive can be searched using the 
Wayback Machine . 

The Archive makes the collections available at no cost to researchers, historians, and scholars. At present, it takes 
someone with a certain level of technic al knowledge to access collections in a way other than our website, but there is 
no requirement that a user be affiliated with any particular organization. 

How can I get a copy of the pages on my Web site? If my site got hacked or damaged, could I get a backup 
from the Archive?' 



Our terms of use do not cover backups for the general public. However, you may use the Internet Archive Wayback 
Machine to locate and access archived versions of your web site. We can't guarantee that your site has been or will be 
archived. We can no longer offer the service to pack up sites that have been lost. We recommend using the Warrick 
Tool . Please keep in mind that this is a third party and we can not promise results. 

Can people download sites from the Wayback? 

Our terms of use specify that users of the Wayback Machine are not to copy data from the collection. If there are 
special circumstances that you think the Archive should consider, please contact info at archive dot org. 

How do you protect my privacy if you archive my site? 

The Archive collects Web pages that are publicly available the same ones that you might find as you surfed around the 
Web. We do not archive pages that require a password to access, pages tagged for "robot exclusion" by their owners, 
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pages that are only accessible when a person types into and sends a form, or pages on secure servers. We also 
provide information on removing a site from the collections. Those who use the collections must agree to certain terms 
of use. 

Like a public library, the Archive provides free and open access to its collections to researchers, historians, and 
scholars. Our cultural norms have long promoted access to documents that were, but no longer are, publicly 
accessible. 

Given the rate at which the Internet is changing the average life of a Web page is only 77 days if no effort is made to 
preserve it, it will be entirely and irretrievably lost. Rather than let this moment slip by, we are proceeding with 
documenting the growth and content of the Internet, using libraries as our model. 

If you are interested in these issues, please join and contribute to our announcement and discussion lists . 
What does 'failed connection' and other error messages mean? 

Below is a list of the main error messages you will see while searching the Wayback Machine. If you see an error 
message that does not have the Internet Archive Wayback Machine logo in the upper left corner, you are most likely 
looking at an archived page or the live web. 

Failed Connection: The server that the particular piece of information lives on is down. Generally these clear up within 
two weeks. 

Robots.txt Query Exclusion: A robots.txt is something that a site owner puts on their site that keeps crawlers like our 
own from crawling them. The Internet Archive retroactively respects all robots.txt. 

Blocked Site Error: Site owners, copyright holders and others who fit Internet Archive's exclusion policy have 
requested that the site be excluded from the Wayback Machine. For exclusion criteria, please see our exclusion policy 
(we use the same one used and developed by other digital repositories and archivists both academic and non- 
academic). 

Path Index Error: A path index error message refers to a problem in our database wherein the information requested is 
not available (generally because of a machine or software issue, however each case can be different). We cannot 
always completely fix these errors in a timely manner. 

Not in Archive: Generally this means that the site archived has a redirect on it and the site you are redirected to is not 
in the archive or cannot be found on the live web. 



Why are there no recent archives in the Wayback Machine? 

It generally takes 6 months or more for pages to appear in the Wayback Machine after they are collected, because of 
delays in transferring material to long-term storage and indexing. 

There is no access to files before they appear in the Wayback Machine. 

How does the Wayback Machine behave with Javascript turned off? 

If you have Javascript turned off, images and links will be from the live web, not from our archive of old Web files. 

How did 1 end up on the live version of a site? or I clicked on X date, but now I am on Y date, how is that 
possible? Why can I only see 930 out of the 2000 results? 

How did I end up on the live version of a site? or I clicked on X date, but now I am on Y date, how is that 
possible? 

Not every date for every site archived is 100% complete. When you are surfing an incomplete archived site the 
Wayback Machine will grab the closest available date to the one you are in for the links that are missing. In the event 
that we do not have the link archived at all, the Wayback Machine will look for the link on the live web and grab it if 
available. Pay attention to the date code embedded in the archived url. This is the list of numbers in the middle; it 
translates as yyyymmddhhmmss. For example in this url 

http://web.archive.Org/web/20000229123340/http://www.yahoo.com/ the date the site was crawled was Feb 29, 2000 
at 12:33 and 40 seconds. 



You can see a listing of the dates of the specific URL by replacing the date code with an asterisk (*), ie: 
http://web.archive.org/*/www.yoursite.com 

Whatever archives we have are viewable in the Wayback Machine. Please note that there is a 6 - 14 month lag time 
between the date a site is crawled and the date it appears in the Wayback Machine. 

Why can I only see 930 out of the 2000 results? 

The list of results displayed shows the total number of pages we have for a given domain name. This includes 
numerous repeats as we return to sites to recrawl their content. The reported results is this total; whereas the smaller 
number relates to the number of unique results only. 
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Where does the name come from? 



How do I cite Wayback Machine urls in MLA format? 

This question is a newer one. We asked MLA to help us with how to cite an archived URL in correct format. They did 
say that there is no established format for resources like the Wayback Machine, but it's best to err on the side of more 
information. You should cite the webpage as you would normally, and then give the Wayback Machine information. 
They provided the following example: McDonald, R. C. "Basic Canary Care." _Robirda Online_. 12 Sept. 2004. 18 
Dec. 2006 . Jnternet Archive_. < http://web.archive.Org/web/20041009202820/http://www.robirda.com/cancare.html>. 
They added that if the date that the information was updated is missing, one can use the closest date in the Wayback 
Machine. Then comes the date when the page is retrieved and the original URL. Neither URL should be underlined in 
the bibliography itself. Thanks MLA! 

How can I get pages authenticated from the Wayback Machine? How can use the pages in court? 

The Wayback Machine tool was not designed for legal use. We do have a legal request policy found at our legal page . 
Please read through the entire policy before contacting us with your questions. We do have a standard affidavit as well 
as a FAQ section for lawyers . We would prefer that before you contact us for such services, you see if the other side 
will stipulate instead. We do not have an in-house legal staff, so this service takes away from our normal duties. Once 
you have read through our policy, if you still have questions, please contact us for more information. 

For more information... 

Check out our Wayback Machine Forum 



How can I add a thumbnail image to my item's details page? 

First, make sure you're logged on to archive.org with the same email address you used to upload the item. 
3 in the meta 

To upload the image: 

• Go to your item's details page 

• Click the "Edit item" link in the lower left box 

• Upload the .jpg 

• After a few minutes, return to your item's details page. Click "Edit item" and find the .jpg file you just uploaded in 
the list of files near the bottom of this page. Select the file format JPEG from the drop down menu, and click the 
submit button. 

• Wait 5-20 minutes for your changes to show up. If you're still not seeing your new file, please try clearing your 
cache and viewing the page again, since you may still be looking at an old version of the page. 



How can I get iTunes to ci 



n playlist when I st 



ti MP3s? 



As an iTunes user, you might have noticed that iTunes loads the Archive's streaming MP3s (M3U files) into your 
library, and subsequentiaily the files get shuffled and are out of order. We have come up with a solution to this 

Step by step instructions: 

• Copy the m3uPlayer application to a permanent location 

• Choose some recording in the Archive to stream. This will cause an M3U to download to your default download 
folder (typically your desktop). 

• Click on the downloaded M3U file, hit option-l (or option-click and select Get Info). Change "open with" from 
ITunes to m3uPlayer (locate it wherever you saved it) 

• Click change all so that all future M3U files will open this way 

That's it! If you have trouble, post a message to this forum 

Thanks to http://www.balnav e s.com/archives/000092.php for the code, instructions, and inspiration 
How can I play OGG files on a Mac? 



On the mac, there is a free component to ogg-ify iti 



s. The 



httBi//w ww.macosxhints.com/article.php?story^20020424233612 40Z 



VLC Media Player will also play OGG files. 
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I'm having trouble with a 'blankVcorrupted ZIP file. What do I do? 

There are a variety of problems that may be causing this. Here are a couple of the most common. If you have a Mac 
running OS X, the default unzip utility (Stuffit) does not deal well with those Archive ZIP files that are 'compressed on 
the fly'. You may see an empty directory - if so, then try downloading Zip Tools for Mac OS X and using the drag and 
drop software within that to unzip your download. [Make sure you save your download to your desktop before trying 
things on it.] If you're having any trouble with downloads timing out or being incomplete, especially on Windows, then 
you may be able to use download managers such as GetRight . These will restart your download if it fails. However, 
some 'ZIP on the fly' downloads don't play well with download managers. If you find that to be the case, the safest 
thing to do is to download each track individually in a download manager. 

How can I add a logo to the upper right corner of my Netlabels collection? 

First, make sure you're logged on to archive.org with the same email address you used when you created your 
Netlabels collection. Then: 

• Go to your collection's front page 

• Click the "Edit Item!" link next to the title 

• Upload the logo to the item's directory 

• Return to collection front page and click "edit" link again 

• Find logo file at bottom of page, choose "Collection Header" from the drop down list and click submit. 
It will take a few minutes for the changes to appear. 

How can I get my tracks to show up in the right order? 

The most reliable way to have your tracks appear on the page in the correct order is to name the individual files with 

track numbers, like this: 

01_nameoffirstsong.mp3 

02_nameofsecondsong.mp3 

03_nameofthirdsong.mp3 

(If you have more than 9 files you need to start numbering with 01 - not 1 - otherwise the files will go in this order: 1 , 
10, 11, 12,2, 3 etc.) 

If you have already created an item and you would like to change the file names to rearrange them correctly, do the 
following: 

1. Click the "Edit Item!" link 

2. Rename your original files using track numbers 

3. Delete all "derived" files, leaving only your original files and the .xml files 

4. Click "Edit item" > "Item Manager" and then click the "derive" button 

It will take a little while for the derive to finish running, but once it does you'll have all new files, in the correct order, in 
both the flash player and the page itself. 

What kind of audio file should I submit? 

The archive is all about free access to information, so you should submit file formats that are easily downloadable 
and/or streamable for other site patrons. 

We prefer that you submit the highest quality file that you have available, and then we will attempt to create smaller file 
sizes and formats automatically with our deriver program. We recommend that you do not attempt to do any special 
encoding of your files - the more settings you mess around with, the less likely our deriver code will be able to process 
the file. 

If you are submitting a Live Music Archive item, please only submit Flac or Shorten files. Even for non-LMA items, 
these are the best formats to use. 

Whatever format you choose, please upload each file to your item individually (you can submit multiple files per item), 
in a non-compressed format. Uploading content in a .zip or .rar file makes your item unstreamable and significantly 
less accessible to others. If you upload .zip, .rar, non-audio formats (like .exe), or password-protected files, they may 
be removed by our moderators. 

The table below describes what file formats we will attempt to derive depending on what type of file you submit. 
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This is automatically generated. 
NOTE: inner whitespace is significant. 

Derivatives for Audio Items 





. . . then we will try to derive the following formats: 


source file is 
format: 


64Kbps 
MP3 


64Kbps 
MP 3 ZIP 


128kbps 
M3U 


Flac FlaC ■. 0gg 
| FingerPrint j Vorbis 


VBR 

M3U 


VBR I VBR 
MP3 i ZIP 


24bit Flac 














64Kbps MP3 














96Kbps MP3 















128Kbps 
MP3 















160Kbps 
MP3 














192Kbps 

— 














256Kbps 
MP3 














320Kbps 
MP3 














Advanced 
Coding 














AIFF 














Flac 














Ogg Vorbis 














Real Audio 














Shorten 














VBR MP3 














WAVE 














Windows 
Media Audio 















The flash player is covering my files! How do I move it? 

If an item has little or no description, sometimes the flash player doesn't have enough room in the top portion of the 
page and covers the files below. If you don't want to add a description (which would be nice, so that people know what 
they're listening to), you can add extra space in the description field using paragraph tags. 



http://www.archive.org/about/faqs.php 



11/10/2009 



Internet Archive Frequently Asked Questions 



Page 9 of 41 



» Click the submit button 



For more information... 

Check out our Audio Forum 
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How do I view the PDF 
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print and bind b ooks? 
What is the directory 



structure for .the. texts? 



Howe; 



I make my 



ailable. 



OpenLibrary.org? 
How do you remove 
Gutenberg texts? 
What is the best way 



> link to a book? 



some books fron 
a series, but not all. 
How c an I access the 



For more information... 



Texts and Books 
Reading Books 
How do i download a book? 



What's the best way to read the books without downloading them? 



The text on screen is too small. How do I zoom in? 

Have you tried following the Read Online link? (It's at the left side of the screen.) 



i" link at the left of 



Another solution is to download the pdf of the book. You can then use a program like Acrobat Reader and view the text 
size at 120%, 150%, etc. 

Adobe Acrobat software is free to download and use http jdpb m/down ds/#Reade 



Probably the simplest way to contribute a text item currently is as a pdf. That way, the entire set of images can be 
submitted as a single file, and there are no special naming requirements, beyond ending the filename with ".pdf. If the 
pdf has no hidden text layer (i.e., isn't searchable), then after doing OCR, Archive.org creates a second pdf with a text 

Items can also be submitted as a stack of image files, one image per page. The files can be in JPEG2000, JPG, or 
TIFF format. We plan to provide a more flexible intake procedure, but at present, there are rather strict requirements 
for how the files in an image stack are to be named, and the stack needs to be packed into a single .zip or .tar file 
before submission. 

When Archive.org scans a book for a Contributing Library, we use the custom-engineered "Scribe" workstation, but for 
many materials, adequate images can be made with off-the-shelf scanners or good-quality digital cameras. For best 
results, use the highest resolution your device is capable of. Most images we process were produced at a resolution of 
300-600 ppi. 

How do you do your sponsored scanning for Contributing Libraries? 

The Smithsonian Institution shares this video about the scanning Archive.org does to help make more of their 
Libraries' materials accessible: 

Smithsonian Institution Libraries: Creating the Digital Library (video) 
One Do It Yourself approach can be found here: 

http://www.instructables.com/id/DIY-Hiah-Speed-Book-Scanner-from-Trash-and-Cheap-C/ 
http://www.instructables.com/id/SGP6LHRFTM72YMN/ 



Discussion as development proceeded is 



<s of http://www.archive.oro/details/thelatchkev01 millarch/ 
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You may wish to also consult http://www.archive.Org/about/faas.php #140 
For more on uploading, see 

http://www.archive.Org/about/faqs.php#Uploadinq Content 
How do I report that something's wrong with a book? 

The Internet Archive strives for fidelity with its sponsored scanning for Contributing Libraries. 
If you see an error, we'd appreciate knowing about it! 

Please send an email with the URL (web address) of the book, and description of the problem, to info -at- 
archive.org 

In some cases, you may know of alternate information about a book that is supplemental to the library bibliographic 
record. (For example, a new, more modern transliteration of an author's name.) 

To share additional information like the above, you may wish to post it for everyone to see using the option to write a 
review of a book. 

What is a book identifier? How is it generated? 

For all items at archive.org, the "identifier" is a unique sequence of letters (with numbers also permitted) that is the 
basic unit of identification of an item. It travels with the digital object, and is involved in all ways of accessing or 
otherwise referring to an item. 

You see the identifier is at the end of a URL (web address). 

For this URL: http://www.archive.ora/details/lifeworksofabrah112linc the identifier is "lifeworksofabrahl 12linc". 
For sponsored scanning books, the Internet Archive uses a custom algorithm to generate each book identifier. 
Example: hereismytitleOOauth 

Using this algorithm, up to 16 characters are pulled from the 245 field in the MARC record ( MARC is a library catalog 
record format), and these make up the first part of the identifier. 

Then, whatever volume information the loader indicates shows up immediately after that (for monographs this will 
usually read 00). And then the first 4 letters of the creator are pulled from the MARC 100 field. 

The algorithm also has rules that pull out any articles or punctuation to decrease the chances of duplicating an 
identifier. 

If an duplicate identifier is generated, the person loading the book record at the beginning of the digitization process is 
notified, and manually edits it to make it unique. 

How do I view the PDF books? 

Please see http://www.arc hive.Org/about/faqs.php#62. 

How do I read the books in other formats, like ePub, Mobi, DJVU? 



ePub is an open textual format (not images of pages). Many readers are becoming available. A free one is from 
Adobe . 

Mobi is a proprietary textual format from Amazon supported on the Kindle. 

DJVU is an open format for scanned documents with free readers for windows, mac os-x. linux. It is compact, 
searchable, good looking, and open format. 

What equipment does the Bookmobile use to print and bind books? 

You can find a list of all the hardware and software used in the bookmobile here: 
http://www.archive.org/texts/bookmob ile-inJt.php 

You can also see a movie of a book being made here: http://www.archive.org/details/HowToMakeABookmov 

What is the status of the Internet Bookmobile? 

Internet Archive's Internet Bookmobile is currently out of commission. 

What is the directory structure for the texts? 
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Note re the instructions below: 

• "XXXX" stands for a 4-digit sequence number, starting with 0000. 

• What you're uploading is technically considered "processed" images, not "original" ones, even though they are in 
fact the originals, because archive.org processors wouldn't be doing any rotating or cropping. 

• The zip or tar has to be built from the parent directory, so that the directory name is included as part of the filename 
of each file stored in the zip/tar. 

In order to store all the texts that the archive has, and will eventually acquire, the directory structure is: 



IDENTIFIER/IDENTIFIER.extension (tif, djvu, pdf) 

IDENTIFIER: Unique in Archive's collection, alphanumeric (URL safe), this is the original name adopted by the 
originating collection (alphanumeric characters and _-. Best if from 5 to 80 characters). One format is [title:8-16][vol:2] 
[author:4][scanninglocation:0-4] 

EXTENSIONS: 

• If the original files are tif files, then: 

• IDENTIFIER_orig.tif: All the orginal tiffs are stored in the form of multi page tiff. Demoware windows viewer 
Informatik Image Viewer. If it goes over 2GB, then it is stored as a tar of singlepage tifs the directory named 
IDENTIFIER_orig_tif/IDENTIFIER_orig_XXXX.tif resulting in a file called IDENTIFIER_orig_tif.tar 

• IDENTIFIER.tif: All the cleaned up tifs (usually cropped, despeckled, deskewed) are stored in the form of multi page 
tiffs. If it goes over 2GB, then it is stored as a tar of a directory named ./IDENTIFIER_tif/IDENTIFIER_XXXX.tif 
resulting in a file called IDENTIFIERJif.tar 

• If the original files are JPEG JP2 or CR2 files, then: 

• All the original jpg files are used to make a zip file named IDENTIFIER_origJpg.zip where the names of the pages 
in the zipped directory are IDENTIFIER_origJpg/IDENTIFIER_orig_XXXX.jpg. If the resulting file is greater than 2GB 
(thus breaking the zip format until zip64 is common), then the file will be in tar format named 
IDENTIFIER_orig Jpg.tar . If the originals are jp2 or cr2 files, then substitute these extentions above. 

• Similarly all the processed jpg files (cropped and deskewed) are used to make a zip file named IDENTIFIERJpg.zip 
where the names of the pages in the zipped directory are IDENTIFIERJpg/IDENTIFIER_XXXX.jpg. If the resulting file 
is greater than 2GB (thus breaking the zip format until zip64 is common), then the file will be in tar format named 
IDENTIFIERJpg.tar 

• In the case where there is a small jpg version of the files for on-screen access then a similar naming convention is 
used from the _orig.jpg version above, but with _200KB resulting in a file named IDENTIFIER 200KBJpg.zip where 
the names of the pages in the zipped directory are IDENTIFIER_200KBJpg/IDENTIFIER_200KB_XXXX.jpg. An 
equivalent version can be done with other sizes and different formats such as jp2. 

• IDENTIFIER.djvu: A nifty open scanned book format created by AT&T Labs and enhanced by LizardTech.com 
enabling compression and ease of reprinting. This file will also be ocr'd to make the text searchable. 

( /djvu/bin/documenttodjvu -filelist.txt temp. djvu, /djvu/bin -ocr aatttt.djvu) 

• IDENTIFIER_djvu.xml this is an xml version of the OCR output which has the word positions (as a bounding box), 
this is used for building the djvu file, and is used for searching the flip books, and maybe constructing a searchable pdf 
in the future. 

. IDENTIFIER.pdf: Adobe acrobat format that is derived from the .tif file if present. 

• IDENTIFIER.txt.tar.gz or .art.tar.gz: If there are OCR'ed text files associated with each page, these are tarred and 
gzipped in txt format or art which is sakhr format. 

• IDENTIFIER_cover.doc or .sxw: 

cover of the book, some in legal and some letter, doc is Microsoft Word, and sxw is OpenOffice. 

• IDENTIFIER_xxxx_bookplate.jp2 or .jpg: is the file that has a bookplate that acknowledges those behind creating 
the digital version, xxxx is the page that it will replace in the access formats. 

• IDENTIFIER_meta.xml: This has the catalog data (title, author, publisher, copyright information) and information 
about the book found while scanning (size, who scanned it) stored in a dublincore-like XML format. 

• IDENTIFIER_meta.mrc: This will be the MARC (Machine Readable Cataloging) records for the book which provides 
the mechanism by which computers exchange, use and interpret bibliographic information and its data elements make 
up the foundation of most library catalogs used today. 

• IDENTIFIERjmarc.xml: marcxml format of marc record 

• IDENTIF!ER_metasource.xml: where the metadata information came from (metadata about the metadata :) ). 
LEGACY FORMATS: This could be OTIFF | PTIFF | TXT. 

• OTIFF: These are the original tiff images of the scans of the books, (to create multipage tifs we used a unix util: 
tiffcp OTIFF/*.tif aaattt_orig.tif) 

• PTIFF: These are processed images (cropped, desqewed.depeckled) from the originaltiffs. 

• TXT: These are the text files that have been created by doing Optical Character Recoginiton (OCR) on the tiff 
images. 

• We plan to eventually remove OTIFF|PTIFF|TXT directories. 

What is OpenLibrary? How can I make my book available via OpenLibrary.org? 

The Open Library is a project of the Internet Archive (archive.org), a non-profit organization in San Francisco, guided 
by the goal of universal access to human knowledge. Our small team is working to create a web page for every book 
ever published, at openlibrary.org. 

Some facts about Open Library you might like to know: 
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• You are free to edit/correct any errors or omissions you see on openlibrary.org - it's an open, editable wiki. (Just 
look for the "EDIT" button.) 

• We serve a catalog some 23 million books, but not the books themselves. 

• We don't buy or sell books 

• We have no way of putting you in touch with authors or publishers 

• Our team isn't able to help you do research on titles you find in Open Library 

There is more information on the Open Library site itself: 

About OpenLibrary.org 
http://op e nlibrary.org/about 

Frequently Asked Questions 
http://openlibrary.org/about/faq 

Developer Center 
http://openlibrary.org/about/tech 

Many authors write in to ask how they can make their book available as a free download via OpenLibrary.org. 
Here's one option: 

Since OpenLibrary.org is a user-editable project, you can sign in to OpenLibrary.org to create a page for your book. 
You can upload the book to Archive.org (see information above), and link to the copy you upload to Archive.org. 

You have the option of choosing a particular Creative Commons license for your work, or making a custom statement 
on what specifically people can or can't do with your item. Remember that if you wish people to contact you regarding 
use permissions, you'll need to provide contact information, such as a mailing address or website. Some uploaders 
choose to include this information in the description field. 

How do you remove line breaks from the Gutenberg texts? 

In Word use find and replace 3 times: 
Step 1 . Find two paragraph markers - A p A p 
Replace with a neutral character ~ or # or @ 
Step 2. Find one para markers - A p 
Replace with a single space 
(This might take about 1 0-1 5 minutes on large files) 
Step 3. Put 2 para markers back in - find - 
Replace A p A p 

What is the best way to link to a book? 

Every book in the Archive has an identifier. For example, RomeoAndJuliet. To link to the book, you should use the 
following URL: 

http://www.archive.org/download/RomeoAndJuliet 
Can I volunteer for the book project? 

Volunteers are welcome to come to our San Francisco location during business hours and help make books. These 
books are given out as calling cards and thank you gifts to help raise awareness to the Internet Archive. Please write 
to info at archive dot org for more information or to make an appointment. 

I see some books from a series, but not all. How can I access the rest? 

Many contributing libraries work with the Internet Archive to scan and provide online access to books. 

To ask about whether there are plans to include additional volumes, or other particular books, you can contact the 
Contributing Library. 

You may wish to also consult http://www.archive.Org/about/faqs.php#195 and http://openlibrary.org/b pl 
For more information... 
Check out our Text Forum 
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Live Music Archive 
A recording I uploaded and marked 'no lossy formats' 



had them created (mp3, ogg, m3u, etc..) . How can I 



If you come across this situation and you are the uploader, click [edit] and then 'Update'. You should see the message 
"Format Options Updated Successfully". Within 10 minutes the system will create a "_rules.conf file in the recording's 
folder. Then, the next time the system performs an automatic sweep looking for changes, it will notice the new rules 
file and remove the lossy files automatically. The sweep occurs approximately twice a day, so you should see the files 
removed within 12-24 hours. 



What is the Live Music Archive all about? 

This audio archive is an online public library of live recordings available for royalty-free, no-cost public downloads. We 
only host material by trade-friendly artists: those who like the idea of noncommercial distribution of some or all of their 
live material. Live recordings are a part of our culture and might be lost in 100 years if they're not archived. We think 
music matters and want to preserve it for future generations. 

The LMA draws strength from the members of etree.org and other online communities of music fans devoted to 
providing public access to high-quality digital recordings of tradable performances. Typically, recordings are made by 
the fans themselves. Recordings are preserved in "Lossless" archival compression formats such as Shorten or FLAC 
(MP3 is not Lossless) for highest quality preservation. 



What are MD5 files? 

MD5 files contain checksums, strings of characters used to uniquely represent a file. These checksums enable users 
to verify that music files downloaded correctly. 

A recommended tool for creating these files is 

this tool you should open the MD5 in a text editor and 
top of the file. 

What are FLAC files and how can I listen to them? 

FLAC stands for free lossless audio codec. It is an open source, lossless compression algorithm for digital music. It 
compresses music files to 50-60% of their original size, with no loss in quality. More FLAC information can be found oi 
the FLAC sourceforqe site and in this etree FAQ . 



To listen to FLAC files: 

ti-format audio player, and then install the FLAC Plugin for 

Windows: Download and install WinAmp . a multi-format audio player, and then install the FLAC Pluoin for WinAmp. If 
you would like to use FLAC with your Windows Media Player (WMP) download and install the Directshow Filters for 
Ogg Vorbis. Speex. Theora and FLAC . This will allow WMP to not only play .flac files but .ogg files as well. 

3: Download and copy "libxmms-flac.so" to your XMMS media player input 



What are FFP files? 



Why; 



e there no shows by band X? 



We'd like to make sure that a trade-friendly band would not mind having their shows in the Archive for public 
download. The best way for us to find out is by getting permission from a band representative or by the band's having 
an explicit policy that covers this type of site. If there are no shows by the band, either we don't have enough of this 
information to go forward with archiving, they have declined participation, or we are ready to accept shows but no one 
has uploaded anything yet. (Also, see the band status FAQ ). 
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sr will otherwise trade-friendly bands who have declined to 



get disconnected 
before the download 
s. What c " 



Bands, see other relevant FAQs here and here . Patrons, s 



•e about how you can help here. 



t are the WAV 

etimes in filesets? 

I just uploaded a 
directoryjhat 
contained WAV MD5 
checksums, is that 



file failed. What can I 



There has been an increasing number of shows uploaded to the Live Music collection without setlist information, or the 
setlist was not properly matched to the files. When you notice a recording like this, please submit an error report only 
if you have an updated setlist, or you are able to match the files up correctly. 

We would prefer that you do not submit error reports letting us know that there is no setlist - tracking down setlists for 
every concert and matching them up to the recordings is a monumental task that has grown beyond the capabilities of 
the small group of Archive.org admins. We would like fans that are familiar with each artist's material to help us with 
this project - in your error report, please give us specific instructions on what changes to make and we will do so. 

How do I bum FLAC files to CD as audio tracks? 

will first need to convert the FLAC files to another format that your burning program is familiar with. Windows 
-s can use the FLAC Frontend . to convert FLAC files to WAV files, which are suitable for burning programs. For 
Macintosh OS X users, Dan Greuel has created a tool called MacFLAC. 

How do I bum SHN files to CD as audio tracks? 



Macintosh: Download and install 



tool, appropriately titled, Shorten for Macintosh. 



Where have all the 
Dave Matthews Band 
conce rts gone? Will 
they be back? 

Why_js there no 
Phish? What about 
Wides pread Panic? 



download manag er 
and now it stop ped 
working. What's the 
deal? 



Linux or any other UNIX-based architecture: Download and install sj 
What is the status of band X for the Archive? 

5/2006. significant s 



w-system presentation of info. We have 3 categories: 

May he Ar chived- Band sections have been activated by Archive admins. Shows can be hosted here to the extent 
permitted by the band. Click on the band name and then through to their Policy Notes link to see what limits they may 
have placed on taping, trading or archiving. 

Pending - When a patron sends us information about having contacted an additional trade-friendly band, the new band 
is considered to be "Pending". Admins will update notes we keep on the band based on the information that people 
send to etree at archive dot org. (Sensitive parts of the info- such as email addresses used- will not be posted in the 



Do you provide an 
" S feed of new 
Jates to the LMA? 



Important: Under the new system, we cannot create a "collection page" for the band name unless and until we know 
at the band May Be Archived. Further, no shows may be uploaded for any band in advance of a band section's 

activation. Under the new system, there is no temporary "upload area" to store filesets for bands whose sections are 
3t prepared yet. Please send shows for bands on the active list only. 



Why don't I get an 
email when my 
uploads fail MD5 
checksums? 



My in-proqress upload 



If your favorite band name is not in any of these 3 categories, there are several possible reasons: They may not be 
trade-friendly in the first place. No one may have contacted them yet. Someone who contacted them may not have 
informed us yet. The band may not have written us back yet. If a band did write to us, we may not have had a chance 
to activate a section yet, or we may not have received enough information back from them to setup their section. In 
le cases, we may not have received the email successfully, so that a resend may be necessary. 

Bands, see other relevant FAQs here and here . Patrons, see more about how you can help here . 

an artist who would like to be included in the Archive, what do I need to do? 

d love to have you! Just write to us at etree at archive dot org in English giving some kind of permission for us to 
archive your shows for public download and noncommercial, royalty-free circulation. It does not need to be a formally 
worded declaration, and can come from anyone you feel has the "say-so." We just need to be clear on how you feel 
about the project. We will put relevant quotes onto a new "collection" page ( examples ) for your performances, along 
with a link to your official website. 



It is necessary for you to email us at etree at archive dot org in order to 



a new section. We want to be si 
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Can I upload live 
recordings that were 
broadcast on XM 
10 or Sirius 



The Grateful Dead is 
here, when will we see 
Jerry Garcia 
recordings? 

Regarding removing 



checked the box to 
remove them and 
clicked update. Now 



■FLAC Fingerprint' file 
with my recording - 
how can I create this? 



about to upload to the 
collection - should I 
include it? 

Where can I find other 
Lecordirigs_b y [trade- 
frjendly band] that 
aren't in the 



I tried downloadi ng a 
show and J got a '403 
Forbidden' page- 
Why? 



What file formats are 
acce p ted for 

Live Music Archive? 

I like adding concerts. 
Do you have a 
preference on the wa „y 
I put in information? 



What are the optio 



What are the optioi 



Where can I see the 
rest of the 'Most 
Downloade d Items' in 



the go-ahead really is coming from you. Please do not attempt to create your own collection, or to upload any of the 
band's shows, in advance of receiving an emailed confirmation message from curators; such attempts may 
significantly complicate or delay the curators' setup process. 

You can give as much or as little scope for archiving as you like. Some bands place limits on what can be hosted, and 
we can accomodate those. Archive Curators, volunteer fans who have proven to be in line with the spirit of this 
archive, will attempt to screen contributions for OK'ed material only. 

At the same time you give the go-ahead, feel free to pass along any notes or policy links on your general 
taping/trading stance as well. You don't need to have a formal written or posted policy before inclusion, but we'd like to 
know how you feel about the topic. 

Besides fans' sending their copies of your shows, you can also prepare and upload your own live recordings to the 
Archive, if you like. In fact, if you'd like to limit your material to selected contributions from you only, please just let us 

If you have any questions about the project, please ask us anytime at etree at archive dot org. 
Can I upload concert videos? 

At this time, video uploads are not being accepted, namely because most of the bands archived prohibit the video 
taping of their shows. Moreover, unlike audio, where we actually have a shot at archiving the vast majority of any given 
band's live concerts (in very high quality format), video is scarce and, unless made by the artist (in which case, it's 
typically for commercial purposes), is not of particularly good quality. 

The progress of my upload says 'File metadata XML invalid. Waiting for user to correct.' How can I fix this? 

This is typically caused by illegal symbols being used somewhere in the information that was put into one of the forms 
submitted with the show (either the import form or "File Options"). Double check that the only characters being used 
are those visible on a standard English-language 104 key keyboard. More information and a few examples are here. 

If you have trouble finding the cause, please post to the forum for help. An admin will have to resubmit the recording 
for another try, so please send an email including a link to the recording to etree AT archive DOT org if you believe you 
have cleared the issue. 

More information on what XML files are and how they are created can be read here. 
I have more Live Music Archive questions. ..who do I ask? 

Feel free to email etree at archive dot org with any questions, and we'll do our best to post the answers here as soon 
as possible. Also, the message board is a great resource; with so many kind, knowledgable folks out there, you can 
often get a speedy answer to your question. 



I have a different si 



e for a show that is already in the archive, should I upload it anyway? 



Yes! In keeping with the nature of this Archive, it is appropriate for multiple sources of the same show to be available 
for download. When you upload the new source, be sure to name the source in the show's top level folder to avoid 
confusion. Some bands do place limits on the types of sources allowed (such as soundboard recordings), so please 
check the policy for any given band. 



How can I help get bands into the Live Music Archive? 

If you know of a trade-friendly live-performing band that is a good candidate for the Archive, you can initiate contact. 
Some tips and letter templates can be found here . When you write, make it clear you are asking about the Live Music 
Archive at archive.org. Don't just ask about their general taping/trading stance. We want bands to know what's up. 

Next, follow up with a message to etree at archive dot org. Mention when you tried to contact the band and what 
contact point you used. These are important in order to update our contact records. Admins will update the contact 
status in an announcement forum about Pendina Bands based on the message you send us. 

If you receive a reply from the band, positive or negative , send a complete copy of the email, complete with its 
sender's address/brief header info, to etree at archive dot org. It's a good idea to send a copy of what you asked them 
as well (if not quoted in the reply), since it will give context to the answer. We need to have full info in hand in order to 
set up the band appropriately in the Archive, and we may need to contact them for followup questions. 

If you are hesitant to make contact yourself, you can mention the band to Archive admins (send email to etree at 
archive dot org) and they can try a contact as time permits. To help out, supply any contact or policy info you may 
already know about the band. 



Most web browsers now support robust http downloading. For questions, see the support website for your browser. 
What are the WAV MD5 files that are sometimes in filesets? 
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MD5 checksums files are not exclusive to SHN files. An MD5 checksum can be used to ensure the accuracy of any 
data file (e.g. .doc, .mp3, .mpeg). Some seeders produce MD5 checksums for their WAV files, as well as for their SHN 
files. This is just an extra level of confirm to ensure exact copies of the original WAV files are being burned from the 
SHN files. Checking a WAV file with a MD5 cheksum is no different than checking a SHN file. If you use mkwACT, you 
can just right click on the wav MD5 and choose "verify." 

I just uploaded a directory that contained WAV MD5 checksums, is that OK? 



The WAV MD5 checksums are ignored by our robot and will not cause problems for your recording. 
My failure email is indicating that the text file failed. What can I do? 

Unlike FLAC or SHN, text files do not translate identically from 1 platform to another. Since the archive.org servers run 
Unix, text files created on other Operating Systems will fail their MD5check. We recommend uploaders remove any 
text files from their MD5's if they are having this problem. 

Can bands place restrictions on material to be archived? 



Yes. Each band can tailor the extent of their permission to the Archive. We quote the band's wishes in the Rights 
section of the band's Collection page . Here are some examp les of special restrictions bands have requested. We point 
out different cases in a band's policy information using a shorthand " Limited Flag " tag. 

We have a contribution system set up to accomodate individual bands' requirements. During the upload process, 
contributors are urged to double check the band's policy notes at different stages. Archive Curators, volunteer fans 
who have proven to be in line with the spirit of this archive, will attempt to screen contributions for OK'ed material only. 
In addition, access to a particular item can be removed if it becomes restricted later (for example, a date newly chosen 
for commercial release must be removed under some band's policies). 

Bands, please contact us at etree at archive dot org anytime to let us know how we can work with you to make things 
happen. 

I just uploaded a show and all the files fail the MD5 check, whafs the deal? 

Please be sure that if you are choosing any upload format, you are uploading the files in "binary" mode. If you try to 
upload .shn or .flac files in "ASCII" mode the files will fail the MD5 check. ASCII is the standard format for encoding 
plain text files (actually a subset of binary), while binary is used to encode almost all other types of files. More 
information on binary vs. ASCII can be found here . 

If this does not solve the problem, be sure that all the file names in the MD5 file match the .shn file names. Be aware 
that the UNIX system the Internet Archive runs on is case-sensitive. 

If you upload FLAC filesets to the LMA, please follow the naming standards to help the checking program here. 
Directories should be named with .flac16 or .flac24 suffix, not .flac. Otherwise, the program will report failures. 

Where have all the Dave Matthews Band concerts gone? Will they be back? 

At the request of the band's management and as a result of the band's 2003 policy change, Dave Matthews Band 
concerts (as well as Dave Matthews solo concerts and Dave and Tim shows) have been removed from the Internet 
Archive. We're very sorry about this unfortunate turn of events but feel like it is important to honor the wishes of the 
band and its management. 

For more information and discussion see this post: 
http://www.archive.org/iathreads/post-view. php?id=3670 

Why is there no Phish? What about Widespread Panic? 

Phish has decided not to participate in the Archive at this point in time. Their official response can be viewed here . 

Similarly, Widespread Panic has opted out of the project for the time being. They were last contacted on 11/9/2004. 
Their response can be seen here. 

I used to use a download manager and now it stopped working. What's the deal? 

Download managers increase your download speed by connecting to the server multiple times. Doing this does not 
significantly increase download speeds but dramatically hurts the performance of the server. If you wish to use queue 
to download from the HTTP servers, be sure you set your download program to only use one connection at a time. 

What's the deal with magic number errors? 

If you get a magic number error when listening to or decoding a SHN file, the SHN file is most likely corrupt. First, 
make sure the SHN file passes MD5 verification; if it does not, redownload the file. If the file passes MD5 verification 
and you are still getting the magic number error, leave am error report via the show details page noting the magic 
number error and which track the error occurs on. Hopefully others who have download the show will confirm or deny 
the error. If the error occurs for all downloaders, the seeder will be contacted to provide a new, uncorrupted track. 
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Please note that there is nothing the Internet Archive administrators can do about a magic number error, becuase the 
only solution to the error is re-encoding the SHN file from the original WAV file. 

Do you provide an RSS feed of new updates to the LMA? 

Indeed! The URL of the feed is http://www.archive.org/services/collection-rss. php?mediatype=etree&collection=etree 
You can plug this into a front end like AmphetaDesk (available at: http://www.am phetadesk.com) 

What does the 'Transferred by' field mean? 

This field indicates the person who did the original DAT/MD/Cassette to WAV conversion. Also, note that in the case of 
recordings made directly to laptops there is no transfer. 

Why don't I get an email when my uploads fail MD5 checksums? 

The system currently only sends emails when MD5 files are included. This means that, if you're uploading FLAC files, 
you still need to generate and include an MD5 file if you want to receive informational emails about the failures. 

A recommended tool for creating these files is MD5summer . Please note that before uploading the MD5 created with 
this tool you should open the MD5 in a text editor and remove the top 3 lines so the first signature is now flush with the 
top of the file. 

Can I log into an FTP server to download concerts? 

Update (2009April): To allow us more flexibility on access, we are discontinuing FTP read access. HTTP read access 
(as in downloading through your web browser), remains more popular with users, and shall continue. 

For more information, please see the discussion forum: 
http://www.ar chive.org/iathreads/post-view.php?id=240921 

My in-progress upload says ' No metadata describing files found. Waiting for user to enter metadata' - what do 
I do? 

There are 2 XML files that get created during the import of any recording in the collection: 

showfolderjmeta.xml 
showfolder_files.xml 

The first file gets created when you submit the import form to the collection. If that file does not exist, you can create it 
by editing the details page and clicking Update. 

The second file gets created by filling out File Options. Just click the link on the left side of the details page and fill out 
the form as accurately as you can. 

If either of these files are missing, your Contribution may give you this message. Please note that once the files get 
created, it takes 5-10 minutes before the system notices them and moves on to the next stage. 

Can I upload live recordings that were broadcast on XM Radio or Sirius Satellite Radio? 

At this point in time, Archive.org cannot host recordings that were broadcast over either of these services. Subscribers 
have informed us that they were required to sign a "Terms of Use" document that forbids the 
recording/hosting/rebroadcasting of any material received from these services. Until we hear otherwise, these 
recordings cannot be hosted here. 

The Grateful Dead is here, when will we see Jerry Garcia recordings? 

The taping policy of the Grateful Dead does not extend to recordings of Jerry Garcia's other lineups. Jerry's solo work 
is controlled by his estate. Representatives have said No to the idea of hosting shows in the Live Music Archive. 

Regarding removing the lossy files ... I edited my show, checked the box to remove them and clicked update. 
Now when I click update again, the box is still not checked. Why? 

It takes 2-10 minutes for your checking of that box to 'stick' ... see this discussion board post: 
http://www.archive.orq/iathreads/post-view.php?id=22816 for an explanation of why. 

The upload instructions require a 'FLAC Fingerprint' file with my recording - how can I create this? 

In Windows: 

1 . Open FLAC Frontend 

2. Drag all of the FLAC files of your recording into Flac Frontend window, (you can also use the "add" button to do this) 

3. Click the "Fingerprint" button. 

4. Save the fingerprint file with a name like this: bandYYYY-MM-DD.ffp 
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I've got a great 'filler' for the recording I am about to upload to the collection - should I include it? 

A 'filler' is music from a different performance in addition to the main recording, typically used to fill up extra space on a 
CD. Sometimes the filler is a different artist, other times it is the same artist, but a different show and date. 

While this is convenient for burning full CD's, it is not appropriate to include fillers on recordings here in the collection 
since they get filed under the artist and date of main performance. Please only include the performance for the artist 
and date you are importing. Fillers should be filed under their own entries elsewhere in the collection. 

Where can I find other recordings by [trade-friendly band] that aren't in the collection? 

If the artist is OK with Internet trading, you may be able to find downloadable recordings through htt p://bt.etree.org or 
http://www.furthurnet.net . Also, check http://db.etree.org to find people who have copies of shows and who may be 
willing to trade. Etree.org has additional trading forums at http://forums.etree.org Lastly, you can check out a band's 
own fan forums and mailing lists. Good luck! 

In contrast, the Live Music Archive forum at the Internet Archive is nor a good place to post about trades, or to ask for 
shows that are not yet archived here, whether or not the band presently has a section here. Moderators may delete 
these posts. More posting etiquette tips for that forum are here . 

I tried downloading a show and I got a '403 Forbidden' page. Why? 

As part of the new (as of May 2007) QA/QC checks that the archive conducts on shows that are uploaded, more 
refined checks are conducted on shows. For more detail, see this forum post: http://www.archive.org/iathreads/post- 
view.php?id=1 24098 What happens though, when a show either fails it's md5 check, it's internal flac checksum check, 
or is missing an info.txt file, every non .xml file in the show fileset (the flac files, the mp3's, etc) all become non- 
downloadable. If you try and click any of the music files, you will be taken to a webpage titled "403 Forbidden" that will 
say: "Forbidden You don't have permission to access "ARCHIVE. ORG_Server/show_location/file" (specific to your 
show file) on this server. **** What this means is that the uploader has a problem with their show files, and as a 
measure to 'stop the spread' of bad files, the system is preventing people from downloading until the uploader contacts 
the archive to fix the show. If you as a user find a show that has the above problem, please check back later and once 
the uploader has fixed the problem, the show will be downloadable as normal. 

How do I upload a show to the LMA? 

As of 5/2006. the upload method has changed significantly. Here is a walkthrough in PDF with screenshots. Another 
texldescri ption is here . 

Before uploading any show , read the band's policy notes for this site . Many artists place .limitations on their material 
here, and info is often updated. Please do not upload shows for any band that does not yet have a curatoweated 
collection page here, even if you know the band has recently emailed their permission. Advance attempts may 
ign ' intly complicate or delay the curators' setup process for the band. 

Next, be sure that you are logged in as an Internet Archive member. Have the fileset on your computer already, 
correctly prepared and correctly named . Files must be in lossless format (.flac or .shn), from lossless parent source 
material; we will optionally create the extra "lossy derivative" copies (.mp3, .ogg) onsite. Prepare to create an item , 
following example tips here or here . 

How do I make corrections to shows? 

Sometimes people make typos or other mistakes on uploads, or leave gaps in info that can be filled in later. You can 
help supply good information for archived items. Here is the current best method to submit corrections: 

If you uploaded the show, you can make the changes to the details page yourself. Make sure you are logged in as the 
user who uploaded the show and go to the details page of the show you are trying edit. Click on the "edit" link next to 
the band name at the top of the details page and you will be able to edit the show details including venue, location, 
source, setlist, etc. Be aware that editing these fields will only change the show details, not the files themselves. 

5/2006 update : If you uploaded the item and would like to replace or add to files within your item, under the current 
system this can be done without reuploading the entire fileset. More description may follow; meanwhile there is a 
walkthrough as a W ord document with screenshots . Specifically to fix your items derived between 5/1 1 -22/2006 that 
sound too fast in the onsite flash player (chipmunk problem), see this PDF document with screenshots . 

If you did not upload the show, please click the 'Report Error' button and state concisely and precisely what the 
problem with that particular show is (If the problem is a missing setlist, please see this FAQ ). If there are one or more 
missing or broken files that you can provide, please re-upload and re-import the entire show under a new directory 
name, and then hit 'Report Error' for the old, broken show, asking for that show to be removed. 



What file formats are accepted for contributions to the Live Music Archive? 

Currently, the Live Music Archive will only accept audio files in either of two lossless formats: FLAC (.flac) or Shorten 
(.shn). Please Note that MKW files (.mkw) are *NOT* an acceptable file format for your contributions because they 
lack cross-platform compatibility (Mac users are unable to play or decode MKW files) 

In addition, please do not upload the lossy files (MP3 or OGG) next to your FLAC or SHN format files - the Archive 
creates those files automatically, provided that the contributor agrees to having them available. This ensures that all 
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the files here have uniform quality options selected. 

Please follow etree.org's Seeding Guidelines when preparing your contributions for addition to the collection. Pay 
particular attention to the Naming Standards section. A well-named identifier helps patrons find your show in our large 
collection. A well-named set of files allows files to be listed in the proper order at the site, and allows patrons to listen 
to them in playlists and burn them to CD in the proper order, too. 

I like adding concerts. Do you have a preference on the way I put in information? 

First of all - thank you so much for contributing to the Archive. Yes, here are some guidelines that will help us maintain 
good records for each concert. 

• Do not include HTML in the source and lineage fields. 

• Do not repeat information in the notes fields (such as source information, or number of discs). Only include 
information in the notes fields that is not already in any other field. 

• If at all possible, keep absolutely nothing but song names in the setlist (even things like disc splits, set splits, 
etc. should not be in this field). If possible, putting all song names on one line, separated by commas is 
wonderful. 

• Do not fill in unknown field with questions marks or N/A - just leave them blank. The exception to this guideline 
is the venue, setlist and source fields (which are mandatory) - in the event that this information is not known, 
simply write "unknown". 

Once again, thank you so much! 

About Grateful Dead concerts on the Archive 

Audience-made Grateful Dead concert recordings are available as downloads while available soundboards are 
accessible in streaming format only. 

The Grateful Dead is being separated from the Live Music Archive into its own collection (with its own forum) to avoid 
confusion about lossless availability. The metadata and reviews for shows and recordings, even those not available for 
regular download, will remain available for those who maintain direct links. No filesets have been deleted from the 
Archive; certain items are simply not public now. Prior to our completing the changes, text files are easily referenced at 
a separate database. 

At this time, the Grateful Dead collection is not open to public uploads. The Grateful Dead Internet Archive Project 
(GDIAP) will continue its direct management of this collection for the time being. 

As far as we know, there has been no change to standard GD fan trading. It is common for bands to have policies that 
differ between fan trading, versus archiving here. 

What are the options for streaming a full recording? 

Hi-Fi: An MP3 playlist, readable by most players, that has the addresses of MP3 files encoded with a variable bit rate. 

Lo-Fi: An MP3 playlist, readable by most players, that has the addresses of MP3 files encoded with at a constant bit 
rate of 64 kilobits per second. These files are ideal for users with slower Internet connections. 

What are the options for downloading a full recording? 

Update 5/2006: Please note that due to a major system transition, many items' ZIP files (for their "Lossless" links) have 
been deliberately disabled for the time being. Engineers are still working on the best method for the new system. 

Lossless: A ZIP file containing Shorten files or Flac files. Unlike formats like MP3, lossless formats are true to the 
original - there is no degradation in quality. 

Hi-Fi: A ZIP file containing MP3 files encoded with a variable bit rate to deliver high quality at roughly 160kilobits per 
second. 

Lo-Fi: A ZIP file containing MP3 files encoded at a constant bit rate of 64 kilobits per second. These files are ideal for 
users with slower Internet connections. 

Other Web Options: All files are displayed as individual links on any item's details page. Web-based download 
managers can be set up to download all the files you want from the page, as a group. For Firefox . the extension 
DownThemAII is a popular option. 

Where can I see the rest of the 'Most Downloaded Items' in the Live Music Archive? 

To view the entire Live Music Archive (everything in the "etree collection") sorted by 'Most Downloaded Items' go to 
this link: http://www.archive.org/sea rch.php?querv=collection%3Aetree&sort=-%2Fmetadata%2Fdownloads 

And here's one that lists everything but the Grateful Dead (like the one on the LMA front page): 
http://www.archive.ora/search.php?auerv=collection%3Aetree%20AND%20NOT%20collection% 
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l&sort=-%2Fmetadata%2Fdownloads 

e the rest of the "Top Batting Averages' of shows in the Live Music Archive? 



For more information... 

Check out our Live Music Archive Forum 



The Internet Archive 

What's the significance of the Archive's collections? 

Societies have always placed importance on preserving their culture and heritage. But much early 20th-century media 
- television and radio, for example - was not saved. The Library of Alexandria - an ancient center of learning 
containing a copy of every book in the world - disappeared when it was burned to the ground. 

Special projects include OpenLibrary.org ( link to fag ) and NASAlmages.org: 

NASA Images was created through a Space Act Agreement between the Internet Archive and NASA to bring public 
access to NASA's image, video, and audio collections in a single, searchable resource. The NASA Images team works 
closely with all of the NASA centers to keep adding to the ever-growing collection at nasaimaaes.org . The site 
launched in July 2008 and now has more than 100,000 items online. 

What is the nonprofit status of the Internet Archive? Where does its funding come from? 

The Internet Archive is a 501(c)(3) nonprofit organization. It receives in-kind and financial donations from a variety of 
sources, including, but not limited to: Alexa Internet , the Kahle/Austin Foundation, the Alfred P. Sloan Foundation, the 
William and Flora Hewlett Foundation, and you. 

Does the Archive issue grants? 



How do I get assistance with research? How about research about a particular book? 

The Internet Archive focuses on preservation and providing access to digital cultural artifacts. For assistance with 
research or appraisal, you are bound to find the information you seek elsewhere on the internet. You may wish to 
inguire about reference services provided by your local public library. Your area's college library may also support 
specialized reference librarian services. We encourage your support of your local library, and the essentia! services 
your library's professional staff can provide in person. Local libraries are still an irreplaceable resource! 

What statistics are available about use of Archive.org? 



What software can play the downloaded movies? 



And, it 



; free! We also recommend MPIayer. 



st versatile player we've found for playing the wide variety of movies found in the Archive. 



For Windows: 

MPEG1 (VCD) most players; 

MPEG2 (DVD) freeware VLC . shareware player from http://www.elecard. 
http://www.apple.com/guicktime/products/mpea2Dlavback/ : 
MPEG4 quicktime6 from www.apple.com or VLC . Latest flash plugin for browsers. 



For Mac OSX and 9: 
MPEG1 (VCD) most players; 
MPEG2 (DVD) freeware VLC ( I 

http://www .a pple.com/quicktim e, 

MPEG-4 Quicktime6. Latest flash plugin for browsers. 



for-pay quicktime6 plugin: 



) the for-pay quicktime6 add-on (se 
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Some Mac users have written to us suggesting MPIayer (OS X), BBDEMUX, and MPEG2DECX - 



What are those 
animations associated 
with each movie and 



Encoding Parameters 



How can I make a DVD 



'e details, troubleshooting, and how to play movies on other operating systems, see this how-to page . 

> I get errors when I try to play a movie? 

The best all-around, free player is VLC Media Player - it handles most of the movie files you will find on this site. If 

e seeing errors when you try to play movies, please try downloading VLC and using that instead. This clears up 
many people's problems. 

Here are some other possible problems: 

1 . There is heavy traffic to oi 

2. You're behind a firewall ai 
firewall administrator. 

3. Your Internet connection went down or timed out. Check with your ISP or network administrator to see if there's 
a special policy about keeping a connection live. 

4. If your browser seems to hang after a "1 00% downloaded" message, check to see that you have sufficient hard- 
disk and TMP disk space. Rebooting the system sometimes helps. 

5. You are trying to play an MPEG-2 file on a platform other than Windows or Linux. At present, you need VLC 
( http ://www. videolan .org ) or the for-pay quicktime6 add-on to play MPEG-2 files on the Macintosh. Please 
contact us at info at archive dot org if you have information about other players that work on platforms other 
than Windows. 

6. 2. Your player tried to stream the movie, and it isn't streamable. Download the movie first, and then play it. 
(Right-click > Save As) 

7. 3. Some conflict exists between your computer's configuration and the player you're using. Unfortunately, 
because PCs can be set up in so many different ways and because different standards exist for playing video, 
finding a player that will work is a hit-and-miss process. Try Rod Hewitt's evaluations of a number of players. 

If you still have trouble, post your question to the moving images forum . 

Can I use these movies in FinalCutPro - in the Quicktime format? 

You can Re-encode Mpeg2 movies to quicktime for FinalCut Pro using Cleaners. 0.2 using the following settings. 
There is no de-interlacing, so you don't lose anything. The files increase in size 10 fold, so make sure you have 
jugh HD space. This procedure gives you quicktime movies suitable for use with final cut. 



Cleaner 5 - if you don't have 5.0.2, you 

- output > quicktime, .mov 

- tracks > process everything 
image > image 



in download. 0.2 from the t£ 



iinayc - unayc oito wnoua... iu 720*480, display Si 

encode > apple DV-ntsc codec, millions of colors, s 
Audio > we're still not sure about which is best, star 



do not deinterlace, field dominance-SHIFT DOWN 
ty 100%, frame rate, same as source 
o, 48kb, experiment. 



Some have had good results with their decoder cards, compare a few films done both ways on a good monitor with 
scopes and see which method is best. 

If you still have trouble, post your question on our discussion list ( moviearchive-subscribe(5)vahooqroups.com ) or write 
to us at info at archive dot org. 

-- NEW -- One of the simplest ways to transcode movies from MPEG-2 to DV format for editing is to use the freeware 
utility MPEG Streamclip (Mac OS X and Windows) available at squared5.com. It offers many settings and maintains 
video/audio sync. 

Sometimes when I play a movie, the video is choppy or very pixeiated. Why is that? 

Try downloading the movie to your computer and watching it locally. Sometimes choppiness occurs when we can't 
stream it to you quickly enough (because your connection is slow or our servers are overloaded). 

If you're watching an MPEG-4 that we derived from an original MPEG-2, we first reduce its size to 320 x 240 - a 
quarter of the resolution of NTSC video. We then translate it at 350 kbps, which is really borderline for that resolution. 
You see errors occasionally because there simply isn't enough bandwidth available, so the MPEG-4 encoder either 
drops frames - resulting in jerky or choppy motion - or drops macro blocks - resulting in blurred or pixeiated video. That 
is the price we pay for the small file size - 80 MB for a 1/2-hour clip is really very small in the digital video world. If this 
is the case, download the original MPEG-2 to solve the problem. 



Who owns the rights to these movies? 



This will vary from movie to movie. 

Many of the movies and collections are licensed with Creative Commons Licenses. Uploaders may designate whether 
or not an item has a CC License. If they do so, the Creative Commons logo will appear on the left hand side of the 
movie's detail page. Click on this logo to see details about the specific type of license that the uploader has assigned 
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to the movie. Archive.org cannot guarantee the accuracy of uploader-provided information. 

Some films may have the contact information listed for the filmmaker. If the information is provided, feel free to contact 
the filmmaker or organization the film comes from. 

Is there a discussion list about the movies? 



Yes — our list is about both movie content and technical issues. You can subscribe at moviearchive- 



Are there other similar archives on the Web? 



There are many sites that allow users to upload videos, but most of them only display very low quality video and/or do 
not let you download the videos. 

As far as we know, this is the only site that presents high-quality downloadable movie data files with such liberal use 
restrictions. See the Links page at Prelinger Archives for a number of sites that may be useful to researchers or those 
seeking specific films or footage. 

What are those animations associated with each movie and how did you make them? 

The animations on the details pages and on the browse pages are animated GIF files. In most cases, still shots from 
each minute of the program were grabbed and saved as JPG files (these are the thumbnails which you can reach by 
clicking on the "View thumbnails" links). Then a tool called ImageMagick was used to create the animated GIF files 
from the JPGs. 

We try to create an animated gif for every movie when it is uploaded (it may take a while to appear), but there are 
some file formats and/or encoding settings that make this difficult. If an animated gif hasn't appeared for your item by 
the day after you uploaded it, we probably couldn't make one for your item. 

Can I stream the movies? 



There are several programs you can use to stream movies in the Archive. Because we allow users to upload video 
files in any format, the same player will not always work for every single file, so it's a good idea to have a couple of 
programs available that you can try. Also, some files simply can't be streamed. Usually, this happens when the 
program that created the video file uses a codec that our software doesn't understand. So if you click on a stream link 
and get an "unsupported media" sort of error, use the download links instead. 

Here are some free players that might come in handy: 

Quicktime 

If you have Quicktime installed, many mp4 streaming movies will play right in your browser window just by clicking a 
stream (or download) link. Make sure you have the latest version so that you can play the widest array of files. 

VLC Media Plaver 

Open your VLC Media Player and go to File > Open Network Stream. Click the File tab and enter the download link of 
the file you want to watch. Yes, this seems backward, but it works! 

So, if you were trying to stream the movie Duck and Cover found at http://www.archive.org/details/DuckandC1951 you 
would: 

Use this URL: 

http://www.archive.org/download/DuckandC1951/DuckandC1951_256kb.mp4 
NOT this URL: 

http://www.arch i vc.org/3tream/DuckandC1051/DuckandC1051_25Gkb.mp4 

VLC will stream mp4, avi, mpg and other file formats, so it is quite useful for viewing the majority of the files in the 
archive. 



Real Player 

You can use Real Player to stream Real Media files. 

We support two bitrates: 32Kbps-1 92Kbps for modem and ISDN users plus 256Kbps-450Kbps for DSL and cable- 
modem users. 



Encoding Parameters 

We attempt DVD, VCD, and MP4 streaming for broadband. We want these parameters to easily work with low-end 
video editors. 

MPEG-2, DVD - 720x480 or 702x480 interlaced. With a system header on each pack to be compatible with DVD. 
(Prelinger movies are 1/2 D1 352x480 29.97 fps which causes some players to make them look skinny) 

MPEG-1 , VCD - Video Resolution SIF (352 x 288 
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PAL, 352x240 NTSC) 

Framerate 29.7 or 25 for PAL 

Video Compression MPEG-1 

Video Bitrate Up to 1151 kbps constant bitrate (CBR) 

Audio 224 kbit/sec MPEG-1 Layer2 

Stereo 44.1 khz 

Created with ffmpeg. 

MPEG-4 - 512Kbps h.264 VBR 320x240 video with 64Kbps AAC audio. Hinted for streaming. Created with ffmpeg 
and mp4creator. 

What is an editable file? 

An editable file is a file which can be downloaded and used in an editing program. The MPEG-4 are the highest bitrate 
versions we could do with the linux mpeg-2 to mpeg-4 conversion tools we use. These files can be read directly into 
FinalCut-Pro from Apple, and can be converted to mov using Quicktime-pro and read directly into iMovie from Apple. 

How do I make DVD's from Internet Archive movies? 

Please read this forum posting about how to create DVDs from many of the movies found in the Archive: 
http://www.archive.org/iathreads/post-view. php?id=26467 . If you have further information to add, please email us . 

How can I make a DVD using linux? 

An Archive user sent in the following instructions for creating DVDs on a linux system: 

To do this under linux from the command line: This requires a few common programs. Using any modern 
package distribution of linux installing these should be quite simple. 

• mplayer (http://www.mplayerhq.hu/) 

• transcode (http://www.transcoding.org) 

• mjpeqtools (http://mjpeg.sourceforge.net/) 

• dvdauthor (http://dvdauthor.sourceforge.net/) 

1 . The first command copies just the video out of input. mpeg and produces output.video: 
mplayer input.mpeg -dumpstream -dumpfile /dev/stdout | tcextract -t vob -a 0 -x mpeg2 > 
output.video 

2. The second command copies just the audio out of input.mpeg and produces output.audio: 
mplayer input.mpeg -aid 128 -dumpaudio -dumpfile output.audio 

3. The third command combines the video and audio back together again in a format ready for 
dvdauthor: 

mplex -f 8 -V -o complete.vob output.video output.audio 

4. This step creates the dvd structure. Create a new file with any text editor with the following: 
<dvdauthor dest="DVD_folder"> 

<title 9 set> 

<titles> 

<pgc> 

<vob file="complete.vob" chapters="0,15:00,30:00,45:00,60:00'7> 

</pgc> 

</titles> 

</titleset> 

</dvdauthor> 

The chapters line lists the points to include chapter marks on the DVD for jump navigation. 

5. Now let dvdauthor create our dvd: 
dvdauthor -x dvdauthor.xml 

Done! You should now have a folder called "DVDJblder" with your movie. You can create an ISO or BIN 
image with mkisofs: 

mkisofs -dvd-video -V "Movie Title" -o movie.iso DVD_folder/ 

You can play movie.iso in most any video player or burn it to a DVD: 
growisofs -speed=16 -dvd-compat -Z /dev/dvd=movie.iso 

If you just want to burn the film to a DVD you do not have to create the movie.iso image file: 
growisofs -speed=16 -dvd-video -dvd-compat -V "Movie Title" -Z /dev/dvd DVD_folder/ 

Can I upload this movie? 

You may upload movies that you own the copyright to, or that are in the public domain. 

We are not copyright lawyers, and copyright is a tricky business, so you may want to consult a copyright researcher to 
clear material before you use it. You may also want to check this list of movies that one of our volunteers has already 
researched. 
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Here is some general information on the subject that may help you decide if your movie is okay to upload. The 
information below applies to films produced in the United States only. 

1 ) Is there a copyright notice visible in the film? It is usually visible with the title or at the end of the film. 

If the work was made in 1923 or earlier, it is probably public domain and can be uploaded. NOTE! Restored 
versions of the film or new soundtracks for silent films can have more recent copyrights that are still valid - usually a 
copyright notice for a new soundtrack or restoration will appear in the film. 

For works made from 1923 to 1949, post a question to the movie forum on this site before you upload. The copyright 
could have been renewed and there isn't a way online to check a film's copyright status. 

For works made from 1950 to 1963, you can check the title at the Library of Congress Copyright Database for 
copyright renewals: http://www.copvriqht.QQv/records/cohm.html . This will list copyright renewals for most films. 

If the copyright notice is 1964 or later, the copyright is probably still valid and the film should not be uploaded unless 
you are the copyright holder. 

2) Is the copyright notice in the correct format? It needs to state three things - the word 'copyright' or the copyright 
symbol or '(c)', the year and who owns the copyright? If it is missing one of those elements or if there is no notice, it 
could be public domain. If you aren't sure, please post a question to the movie forum on this site. 

3) Is the film foreign (not from the U.S.)? Foreign titles might not have a copyright notice, but still may be 
copyrighted in their country of origin. Traditionally the U.S. wouldn't recognize the copyright of a foreign film unless it 
was registered in the U.S. That has recently changed with the GATT treaty. Many foreign works had their copyrights 
restored. Please post a question to the movie forum on this site about these films before you upload. 

What kind of movie file should I submit? 

The archive is all about free access to information, so you should submit file formats that are easily downloadable 
and/or streamable for other site patrons. 

We prefer that you submit the highest quality format that you have available, and then we will attempt to create smaller 
file sizes and formats automatically with our deriver program. MPEG2 files are the easiest file type for us to deal with. 
We recommend that you do not attempt to do any special encoding of your files - the more settings you mess around 
with, the less likely our deriver code will be able to process the file. 

Whatever format you choose, please upload each file to your item individually, in a non-compressed format. Uploading 
content in a .zip or .rar file makes your item unstreamable and significantly less accessible to others. If you upload .zip, 
.rar, non-video formats (like .exe), or password-protected files, they may be removed by our moderators. 

The table below describes what file formats we will attempt to derive depending on what type of file you submit. 
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This is automatically generated. 
NOTE: inner whitespace is significant. 



Derivatives for Movies Items 





. . . then we will try to derive the following formats: 


If your source file is format: 


512Kb MPEG4 


Animated GIF ! Ogg Video 


Thumbnail 


3GP 


" 






56Kb QuickTime 




••~ " ■ f 




64Kb V1PEG4 








64Kb QuickTime 




i 




256Kb MPEG4 


"-• " 


— t 




256Kb QuickTime 




t 




Cinepack 








DivX 


" 


[ - 




DV Video 








Flash Video 








: h.264 V1PEG4 








ISO Image 








IV50 








Matroska 








Motion JPEG 








MPEG1 








MPEG2 








MPEG4 








Ogg Theora 








QuickTime 








Real Media 








Windows Media 








1 1 embed a flash player with my movie on my web 


page? 



It's really easy to embed our flash player with your movie into your web site. To do so, go to the item page for the 
movie you want to embed. Then click the flash player as if to watch the movie. When you do, you'll see a small 
question mark beneath the player. Click on this and you'll get the instructions and code you need to embed the movie 
into your web page. 

For more information... 

Check out our Moving Images Forum 




Downloading Content 



Can I download files via FTP? 



For more information, please see the discussion forum: 
httB ://www.archive.org/iathr eads/post-view.php?id=240921 
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What are some good FTP clients? 

Update (2009April): Please note that to allow more flexibility on access, we are discontinuing FTP read access. 
HTTP read access (as in downloading through your web browser), remains more popular with users, and shall 
continue. 

FTP can yet be very U3cful for your up l oads. 



• Filezilla (support open source!) 
. SmartFTP 

• FTP Commander 

For Mac Users 

• Filezilla (support open source!) 

• Cyberduck (support open source! 

• Transmit 



How do I download files? 



r "Save Link As" (or 

snu comes up, select "Save 



If I remove my 
acco unt, will my items 
also be removed from 
the Archive? 



When I attem pt to 



in using my username 
and password. I am 
told that the username 
or password is invalid. 
What could be wrong? 



W hat is the difference 
between a virtual 
lib rary card and an 
account? 



How do I change my 



What happens to my 



f orum posts and 
movie, software, 
audio, and book 



What happens if mv 



How can I remove rr 



Virtual Library Cards (AKA Accounts) 

If I remove my account, will my items also be removed from the Archive? 



:e your account. If you would like your items removed, please 



I forgot my password, what c; 



I do? 



As long as you remember the email address which you originally used when signing up for your virtual library card, you 
can use this form to have your password emailed to you. Bear in mind that your password will be sent in clear text, 
which means that anyone who views the email (or anyone with sophisticated "packet sniffing" software) can obtain 
your password. For this reason you should return to the Internet Archive website once you have your old password 
and change it to something ne w. 

n told that the username or password is 

There are several things to keep in mind when you encounter this error. 

• Your username is your email address, not your screen name. Make sure you enter the same email address that 
you supplied when signing up for your virtual library card. 

• Your password is case-sensitive. Check to see if the CAPS-LOCK key is engaged (typically a light would be 
illuminated on your keyboard). 

• You might have forgotten your password. If you think this is the case, you can have your password emailed to 
you here 

What is the difference between a virtual library card and an account? 

These two terms are used interchangably. 

How do I change my password? 

You can use this form to change your password. 

How do I change my screen name? 

You can use this form to change your screen name. 

What happens to my forum posts and movie, software, audio, and book reviews when I change my screen 
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account? 


name? 


What is an Open ID? 
Do 1 have to register 
for one to use 
Archive.org? 


Your old reviews and posts will be updated with your new screen name. 

What happens if my email address changes? How can 1 change my email address? 


My account is locked. 


You can use this form to change your email address. 


What can 1 do? 


However, be aware that if you change the email address for your account, you will no longer be able to "edit" files 
posted from your old email address. If you would like to have your items' ownership transferred to a new email 
address, send an email to info AT archive DOT org from your OLD email address (the one you want to get rid of - 
that's how we know you own the items) and tell us which address you'd like to change it to. 






How can 1 remove my account? 




You can use this form to remove your account. 




What is an Open ID? Do 1 have to register for one to use Archive.org? 




For what an Open ID is and how you can use it, see http://openid.net 

An Open ID is not required to obtain a library card (account) for Archive.org 




My account is locked. What can 1 do? 




It is likely that your account was locked because you uploaded multiple items that seemed to have rights issues or the 
content you uploaded was inappropriate for the Archive, if you do have rights to the content you uploaded and you 
believe it is appropriate for Internet Archive, please contact us with your thoughts at info AT archive DOT org. 


Questions 

Why not Squid or 

mod proxv? 

Why FreeCache? 

Why not BitTorrent? 

What files are beina 
served by FreeCache? 


FreeCache 

Why not Squid or mod_proxy? 

Both Squid and mod__proxy are great for reducing the load on web servers, and we encourage everybody to use them. 
The disadvantage of these caching proxies are that they only work "vertically", i.e., they reduce the bandwidth 
downstream from the originating web site to the users' browsers. That web site still gets 1 download per (non- 
cascading) proxy. The FreeCache system works more "horizontally", i.e., FreeCaches fill themselves up from 

caching proxies are complementary technologies. Both can be used to reduce the impact on web sites. 




Why FreeCache? 


What's a good 


FreeCache is a demand-driven, distributed caching system. Cooperating caches exchange files without burdening the 




original site too much. 




Why not BitTorrent? 




BitTorrent clients for this balancing; these clients often become un-available after a particular file is not popular 
anymore. The FreeCache system utilizes permanent FreeCaches that don't go away (although particular files get 
flushed out after a while). Unlike BitTorrent, the FreeCache system is transparent to the end-user. No new client or 
server software is required, and the files do not need to be converted. To offer a file via the FreeCache system, all you 
need to do is prefix the URL with http://freecache.org/ 




What files are being served by FreeCache? 




FreeCache can only serve files that are on a web site. If the link to a file on that web site goes away, so will the file in 
the FreeCaches. Also, there is a minimum size requirement. We don't bother with files smaller than 5MB, as the saved 
bandwidth does not outweigh the protocol overhead in those cases. 




What's a good download manager? 




We like wget, because you can tell it to play nice and go slow. It's highly configurable and very powerful. Wget runs on 
all Unix platforms (incl. Mac OS X), and it comes standard with Cyqwin on Windows. If you prefer something graphical, 
Mozilla's built-in download manager works fine. 






Questions 


DocuComp 


What is DocuComo? 


What is DocuComp? 


What do 1 need 1 to 
know to use 
DocuComp in the 


DocuComp is a sophisticated technology that compares inserted, deleted, replaced and moved text and content in 
Web pages. It's patented algorithm has been specially designed and licensed for use in the Wayback Machine. 
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What do I need I to know to use DocuComp in the WayBack Machine? 

You only need to know the basic functions of the Wayback Machine. Begin by typing an URL into the Wayback 
Machine and hit the 'Take Me Back' button. Once you've found your choices on the results page, click the 'Compare 
Archive Pages' button in the upper right hand corner of the page. The reloaded page will have a series of check-boxes 
before each page date. Check any two dates and select the 'Compare two dates' button in the upper left-hand coi 
of the screen. The system is designed to automatically generate results for any URL's indexed by the Wayback 
Machine. 

What Archive Pages are comparable? 

in compare any two pages from the Archive's library dating from 1! 



:o the present (approximately 55 billion 



pages). 

Why should I compare results of past Web pages? 

Access to the Archive's Collections is provided at no cost to you and is granted for scholarship and research purposes 
only. The DocuComp feature is intended to provide interesting insight into how content on pages in every field- from 
the government to entertainment to business sites- changes over time. 

Where can I find out more about DocuComp? 

Please visit the ww.docucomp.com site. DocuComp is a widely-used technology that is licensed by it's parent 
company, Advanced Software, into many of the software products and content management systems available today. 
Formerly a standalone application for Advanced Software, the company now focuses exclusively on licensing the 
DocuComp technology and patent to software vendors. 

How are images compared? 



Some images are missing in my comparison? 

srtain cases, images within the Web pages are not available. Not all images are archived nor are retrievable from 
the original site. If they no longer exist on the original site then the images will not be available and not displayed within 
the archived pages. 

Certain links or actions are not working in the comparison results? 



How can I report problems? 

After comparing two pages, the upper frame on the results page includes a hyperlink to report results which return any 
page faults. By clicking this hyperlink, an automatic error report is generated to both the Internet Archive webmaster 
and DocuComp's technical team. If you wish, there is an additional help screen to describe the issue. Please keep in 
mind that with over two billion pages to index and compare, not all being created alike; some pages will differ greatly 
not have a common frame of reference to effectively compare. 

Can I copy and use my results? 

The results of any comparison done on the Internet Archive site are governed by the terms of use listed at: 
http://www.archive.org/about/terms.php . Additionally, any use of the DocuComp trademark or logo without express 
written permission by Advanced Software, Inc and any of it's affiliates is prohibited by law. 

Guidelines for Press, Magazines and General Media 

DocuComp is a registered trademark of Advanced Software, Inc. Please contact the company at (866) 329-7480 or 
info(a )docucomp.com for background information on the company's history, technology data, 
interviews. 



Preiinger Movies 
How did you digitize the films? 

The Preiinger Archives films are held in original film form (35mm, 16mm, 8mm, Super 8mm, and various obsolete 
formats like 28mm and 9.5mm). Films were first transferred to Betacam SP videotape, a widely used analog broadcast 
video standard, on teiecine machines manufactured by Rank Cintel or Bosch. The film-to-tape transfer process is not a 

time process: It requires inspection of the film, repair of any physical damage, and supervision by a skilled 
operator who manipulates color, contrast, speed, and video controls. 
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The videotape masters created in the film-to-tape transfer suite were digitized in 2001-2003 at Prelinger Archives in 
New York City using an encoding workstation built by Rod Hewitt . The workstation is a 550 MHz PC with a FutureTel 
NS320 MPEG encoder card. Custom software, also written by Rod Hewitt, drove the Betacam SP playback deck and 
managed the encoding process. The files were uploaded to hard disk through the courtesy of Flycode. Inc . 

More recently, Prelinger films have been digitized and uploaded by Skip Elsheimer at AV Geeks . 

The files were encoded at constant bitrates ranging from 2.75 Mbps to 3.5 Mbps. Most were encoded at 480 x 480 
pixels (2/3 D1) or 368 x 480 (roughly 1/2 D1). The encoder drops horizontal pixels during the digitizing process, which 
during decoding are interpolated by the decoder to produce a 720 x 480 picture. (Rod Hewitt's site C oolstf shows 
examples of an image before and after this process.) Picture quality is equal to or better than most direct broadcast 
satellite television. Audio was encoded at MPEG-1 Level 2, generally at 112 kbps. Both the MPEG-2 and MPEG-4 
movies have mono audio tracks. 

To convert the MPEG-2 video to MPEG-4, we used a program called FlasK MPEG. This is an MPEG-1/2 to AVI 
conversion tool that reads the source MPEG-2 and outputs an AVI file containing the video in MPEG-4 format and 
audio in uncompressed PCM format. We then use a program called Virtual Dub that recompresses the audio using the 
MPEG-1 Level 3 (MP3) format. This process is automated by the software that runs the system. 

An article on re-coding Prelinger Archive films to SVCD so you can watch them on your DVD player. 

See archived version of www.moviebone.com/ 

Do I need to credit the Internet Archive and Prelinger Archives when I reuse these movies? 

;e of archival material, in order to help make others aware of this site. We suggest 



Archival footage supplied by the Internet Moving Images Archive (at archive.org) 



"Archival footage supplied by archive.org" 

Do I need to inform the Internet Archive and/or Prelinger Archives when I reuse these movies? 

No. However, we would very much like to know how you have used this material, and we'd be thrilled to see what 
you've made with it. This may well help us improve this site. Please consider sending us a copy of your production 
(postal mail only), and let us know whether we can call attention to it on the site. Our address is: 

Rick Prelinger 

c/o Internet Moving Pictures Archive 
PO Box 29064 
San Francisco, CA 94129 
United States 



:o these movies on videotape or film? 



Archive Films/Archive Photos 
75 Varick Street 
New York, NY 10013 
United States 
+1 (646)613-4100 (voice) 
+1 (646)613-4140 (fax) 
+1 (800) 876-51 15 (toll free in the US) 
sales@archivefilms.com 

Please visit us at www.prelinaer.com/prelarch.htmi for more information on access to these and similar films. Prelinger 
Archives regrets that it cannot generally provide access to movies stored on this Web site in other ways than through 
the site itself. We recognize that circumstances may arise when such access should be granted, and we welcome 
email requests. Please address them to Rick Prelinger . 

The Internet Archive does not provide access to these films other than through this site. 
What parameters were used when making the Real Media files on the website? 
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Rod Hewitt posted some very useful information here 

Are there restrictions on the use of the Prelinger Films? 

There are no restrictions. You are warmly encouraged to download, use and reproduce these films in whole or in part, 
in any medium or market throughout the world. You are also warmly encouraged to share, exchange, redistribute, 
transfer and copy these films, and especially encouraged to do so for free. 

Any derivative works that you produce using these films are yours to perform, publish, reproduce, sell, or distribute in 
any way you wish without any limitations. 

Descriptions, synopses, shotlists and other metadata provided by Prelinger Archives to this site are copyrighted jointly 
by Prelinger Archives and Getty Images. They may be quoted, excerpted or reproduced for educational, scholarly, 
nonprofit or archival purposes, but may not be reproduced for commercial purposes of any kind without permission. 

If you require a written license agreement or need access to stock footage in a physical format (such as videotape or a 
higher-quality digital file), please contact Getty Images . The Internet Archive does not furnish written license 
agreements, nor does it comment on the rights status of a given film above and beyond the Creative Commons 
license. 

We would appreciate attribution or credit whenever possible, but do not require it. 
Can you point me to resources on the history of ephemeral films? 
See the bibliography and links to other resources at www.prelinger.com/ephemeral.html . 
Why are there no post-1964 movies in the Prelinger collection? 

Because of copyright law. While a high percentage of ephemeral films were never originally copyrighted or (if initially 
copyrighted) never had their copyrights properly renewed, copyright laws still protect most moving image works 
produced in the United States from 1964 to the present. Since the Prelinger collection on this site exists to supply 
material to users without most rights restrictions, every title has been checked for copyright status. Those titles that 
either are copyrighted or whose status is in question have not been made available. For information on recent changes 
in copyright law, see the circular DuraJlojiofJJcjjyjigM (in PDF format ) published by the Library of Congress 

For more information... 

Check out our Prelinoer Archives Forum 



Search Tips 

Can I see a list of the most downloaded movies? 



Can I see a list of the most downloaded audio files? 



. All Audio Items (not including Live Music Archive) . 
. ALL Live Music Archive concerts 
. LMA concerts (without the Grateful Dead) 
. Grateful Dead only 

1 1 search by Creative Commons License? 

Yes, you can. But it's a little complicated. 



/metadata/licenseurl:http*abbreviation/* 



If you want to use this in combination with other queries, like "I want by-nc-nd items about dogs" you'd do 
this: /metadata/!icenseurl:http*by-nc-nd/* AND dog And you'd get 195 items. The AND tells the search engine all the 
is returned should have that license AND they should contain the word dog. AND has to be in all caps. 
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Just to make it easier, here are the basic searches: 



• Public Domain 

• Attribution Non-commercial No Derivatives (by-nc-nd) 

• Attribution Non-commercial Share Alike (by-nc-sa) 

• Attribution Non-commercial (by-nc) 

• Attribution No Derivatives (by-nd) 
. Attribution Share Alike (by-sa) 

. Attribution (by) 



want to add LOTS of 
ndividual items to the 
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Can you tell me a bit 
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files U i movies ! 
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Uploading Content 

How can I add my music, movies, or text? 

You may contribute content to the Internet Archive if it's in the public domain or if you own the rights to it. If you own 
the rights, we recommend that you choose a Creative Commons license for it so that others will know how they may 
(or may not) use it. You can choose a type of Creative Commons license during your upload process. 

Please note that if you wish to be contacted with inquiries regarding your item, you'll need to supply public contact 
information. Some chose to provide a web address, mailing address, or other means of contact in the description text 
for the item. 

See also http://www.archive.Org/about/faqs.php#Rights 



For books, please see http://www.archive.Org/about/faq s.php#195 
How does the Share button work? 



To use the new beta uploader: 



e upper right-h; 



i corner of the site, or click hgre. 

it to upload. You can select more than 01 



3 file, or you 



• First click the "Upload" button near t 

• Now you can see the Share button. 

• Click the Share button to browse for the media 
can click the Share button again to select addit 

• Archive.org will automatically detect which media collection (movies, audio, texts, or other) your item belongs to, 
according to the type of the first uploaded file. 

• You also have the option to click the link to change the file type if needed. 

• As the file(s) upload, enter the information about your file in the given fields. 

• When everything is complete, click the "Share my File(s)" button at the bottom of the page to create your item 
page on Archive.org. 

You can track the progress of your items in our catalog . 

We accept audio, video, and text files. 

I want to add LOTS of individual items to the archive, how do i do that? 

If you have a large collection of related items in single media type, like a radio show for example, please contact the 
Internet Archive. You can email our collections staff at info at archive.org. Please put start your subject line with 
"Collections:". 



How can I report ai 



How can I make changes to my item? 
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If you want to change your item's metadata (like title, description, file formats and titles, running time, language, 
etc.), or change the files in your item (remove files, upload new/more files, rename files, etc.), you can do this using 
the new "Edit Item!" link. Here's how: 

• Make sure you're logged in with the account you used to upload the item 

• Go to your item's details page 

. Click the "Edit item" link in the lower left box. 

• Select the "change the information" link 

Your changes will appear in 20-30 minutes. 

If you have uploaded new files and you want us to make derivative files (smaller, more compressed versions), you will 
need to do one more thing. 

. Click "Edit item" 

• Select the "change the information" link 

• Click "Item Manager" 

. Click the "derive" button 



How can I take my files off the site? 

http://www.archive.Org/about/faqs.php#264 

If you would like us to take down an item you have posted, please send an email to info [AT] archive [DOT] org. Please 
include the exact URLs of the items. Your email must come from the same email address you used to upload the item. 
This is the only way we can tell that you are the owner of the item. 

Can you tell me a bit more about choosing a license? 

From the Creative Commons website: "Creative Commons licenses help you share your work but while keeping your 
copyright. Other people can copy and distribute your work, but only on certain conditions." 

You can choose a license to associate with your contribution and this license will be linked to when users see the 
details page. 

How should I name the files for movies I upload? 

Take for example a movie called My Home Video. The identifier (AKA base name) for this movie should be something 
like MyHomeVideo. The naming convention for the files depends on the encoding. 

MPEG-2: 

MyHomeVideo. mpeg 
MPEG-1: 

MyHomeVideo. mpg 
DivX: 

MyHomeVideo.avi 

QuickTime: 
MyHomeVideo. mov 

Windows Media: 
MyHomeVideo.wmv 

Real Media: 
MyHomeVideo. rm 

MPEG-4: 

MyHomeVideo. mp4 

If you know the bitrate of the encoding (for QuickTime, Windows Media, Real Media, or MPEG-4), please include in 
the file name as such (using 64 as the bitrate and QuickTime as the format, for example): 

MyHomeVideo_64kb.mov 

During upload, I get an error message about 'illegal characters' or 'file name prohibited.' What does this 
mean? 

The folder or files that you are attempting to upload have characters in the name that cause problems with the system 
- so we have designated them "illegal". This includes the following characters in the name: 



*(){}[]/\$%@# A &l<> , -'!? + 
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In addition, files and folders may not have spaces in their names. 

You will need to remove any of these illegal characters by renaming the file(s) in order for the system to accept your 
contribution. 

What languages are supported by Archive.org? How can I use accented or special characters in my title or 
description? 

What languages are supported by Archive.org? 

Archive.org supports all metadata about items in just about any language so long as the characters are UTF8 
encoded. 

(1 ) example of language:korean 

htt p://www.archive.org/details /Shall We Protest the Candlelight_Documentary-iso 

(2) example of language: Arabic 

http://www.archive.org/details/ktb_tragm„,rgal_pdfbook..ara 
Filename support: 

Support for Filenames is limited to pretty basic ASCII characters, like 
A-Z 

a-z 
0-9 



Additional character support for filenames is not an area under development at this time. 
How can I use accented or special characters in my title or description? 

You can use accented and other special characters in your item text and file titles, but you need to make sure 
you use the xml-safe code for those characters instead of typing them directly into the forms. 

Typing accented characters directly into forms can break the xml for your item, making your files unavailable 
through the site. 

Instead, you'll want to use a special code to represent those letters. There are some examples in the table 
below, but you can find a complete listing of these codes on 

http://en.wikipedia.org/wiki/List of XML and HTML character entity references - you'll use the number in 
parentheses in the "Unicode code point" column. 

Here are some common accented and special characters and what you should replace them with: 



To Make This Character. 



Replace It With This Code 



So to write the word cafe you would actually write cafe - you replace the letter e with the code e 

There are many, many more codes than the ones listed above, of course. You can find more at 
http://en.wikipedia.oro/wiki/List of XML and _HTML character entity .references . 

What kinds of formats do you want me to use for uploading? 
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The Internet Archive strives to archive content in open formats that are friendly to long-term storage and access. In 
addition to affecting long-term storage and access, giving us media in these formats will assure that they are 
accessible now, since many problems with long-term accessibility such as DRM and propriatary codecs also cause 
problems today. 

However, if you have content that is not available in an open/recommended format (see below), we will still happily 
archive it. Our systems are not tied to specific media formats and in fact are capable of archiving any type of digital 
data that can be represented as a file. 

Format Recommendations: 

We encourage users making contributions to the Archive to create as high quality versions of their media as possible. 
As we know access is important and not everyone has a high speed connection, we will take these archivable copies 
and create much smaller version for users with slow connections. Remember, a WAV file may seem big, but it won't be 
in 5 years. Further, you can always make lower quality files (e.g. mp3s) from higher quality files, but cannot go the 
other way. 

For video we typically recommend MPEG2 (DVD quality), or if you do not have MPEG2, MPEG1 or MPEG4. 
For audio we recommend WAV or FLAC (preferably 24 bit). 
For text we recommend plain text, xml, or pdfs. 
How should I name the audio files I upload? 

Take, for example, an audio called My Music. The identifier for this audio should be something like MyMusic. The 
naming convention for the files depends on the encoding. 

MP3: 

MyMusic. mp3 

WAVE: 
MyMusic.wav 

Flac: 

MyMusic.flac 

Shorten: 
MyMusic.shn 

Ogg Vorbis: 

Windows: 
MyMusic.wma 

Real Media: 
MyMusic.ra 

If you know the bitrate of the encoding, please include it in the file name. For example: 

MyMusic_64kb.mp3 

How can I take my files off the site? 

If you would like us to take down an item you have posted, please send an email to info [AT] archive [DOT] org. Please 
include the exact URLs of the items. Your email must come from the same email address you used to upload the item. 
This is the only way we can tell that you are the owner of the item. 

I just uploaded my files, and I got an error message that says there's a problem with my metadata - but I 
haven't added any metadata yet! 

When you create an item, we "check out" a directory for you to upload files into. When you're done uploading, you 
"check in" the directory (by cicking a link on the check out page, or clicking the "click here when done" icon). 

Checking in an item lets us know you're done uploading, and the first thing we do is back up your files to a second 
server (so we'll have two copies of everything). Sometimes, when it's taking longer than usual to complete this backup, 
you'll get an error message that says there's a problem with your metadata. If you wait a little while (usually just a few 
minutes, but occasionally longer), you should be able to continue the upload process without any trouble. 

If you uploaded metadata with your files, or you've gotten this error after you've added metadata (title, description, file 
titles, etc.) then you may have a problem. Usually an item breaks because you used special characters that broke the 
xml files for your item. Please feel free to use the link on the error page to report the problem to us and we'll try to help 
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you fix it. 

What is the relationship between Internet Archive and OurMedia? 

The OurMedia collection on archive.org can be found at http://www.a rchive.org/details/ourmedia. Users can upload to 
this section directly from the OurMedia site on this page. If you have questions or concerns about your item(s) in 
OurMedia, please contact them directly - 



clickable in my posts? 

How can I fori 
in my posts 

How do I 



1 1 make links clickable in my posts? 

You may have noticed that some posts have highlighted links in them. Internet Archive forums permit the use of HTML 
codes. Suppose you want to make a link to the Internet Archive home page, one that looks like this: Internet Archive 
home page . To do this, you would enter the following HTML code: <a href="http://www.archive.org">lnternet Archive 
home page</a>. 



How can I format text in my posts 



How do I subscribe/unsubscribe to a forum email list? 



How can I get a node? 
If I get a node, can my 



How can I connect to SFLan? 

With a laptop: Be in the vicinity of a SFLan node. Associate with it: The SSID is sflanNN, where NN is the number of 
node, e.g. sflanl 3. No WEP. You'll get an IP number assigned via DHCP. With a house: Contact us at info at archive 
dot org. (Please include your address and a phone number.) Find out if you have line of sight to another SFLan node, 
buy a node, and we'll put it on your roof. 

I live at 123 Main St at Crossing; do I have line of sight access to a node? 



What is the cost of a node? 



What are the power 



What is the percentage 
of uptime? 

What about IP 



I still have more 
q uesti ons, what 
should I do? 



How can I get a node? 

Send an email with your name, exact address and phone number to info at archive dot org. Be sure to write "SFLan 
node" (or something similar) in the subject line. The information will be passed on to our fantastic installation team who 
will contact you. 

If I get a node, can my neighbors connect also? 

Yes, a SFLan node can connect your neighbors and co-condo association members. 
What is included in the node? 

Most of our nodes are composed of two radios, but some have three. The components are in a weather tight box with 
a four foot coax cable and two antennas attached. The whole unit is mounted on your roof (generally) on a pole. There 
is a picture of our lovely 5'3" spokesmodel holding one here: http://www.archive.org/iathreads/uploaded-files/AstridB- 
PICT0017.JPG 

What are the power requirements of a node? 

A node takes on average 5 watts. 

What are the connection characteristics of the network? 

There are no average characteristics, but 2MBs shared among 20 oi 



o people would be an example. 
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What is the percentage of uptime? 

SFLan is an experimental network, so the uptime varies. Right now uptime averages around 90% or more. 
What about IP addresses? 

SFLan uses real, routable IP addresses. These are usally given out dynically via DHCP. The nodes themselves use 
static addresses. We can also assign static addresses for servers. For the techies: We use tunneling, layer 2 and layer 
3 bridging in parts on the network to make it all appear as a "flat" LAN. There are pros and cons about this approach. It 
has worked best for us so far. However, it is a moving target, and might change in the future. 

I still have more questions, what should I do? 

■e questions, try the SFLan forum. If you still need help, write to info at 



Report Stem 

How do I report that there's an issue with an item? 

The Internet Archive (Archive.org) is a nonprofit library that preserves di 



a million u: 



a day with the goal of universal ai 

st Archive's Terms of Use , please send an email with the URL (web 



The Internet Archive follows the Oakland Archive Policy for Managing Removal Requests And Preserving Archival 
Integrity. (When reviewing the Oakland Archive Policy, please note that information about requests coming from 
webmasters is information to assist with archived websites in particular.) 
For more information, see htt p://www.archive.Org/ab out/faqs.php#Rights. 

How do I report that something's wrong with a book? 

The Internet Archive strives for fidelity with its sponsored scanning for Contributing Libraries. 
If you see an error, we'd appreciate knowing about it! 



To share additional information like this, you may wish to post it using the option to write a review of a book. 



There's a problem with the item - what next? 

Some changes to our system, to individual items, or to collections can take a day to appear on Archive.org. If you're 
experiencing a problem with an item, we recommend trying again after a day. Often the issue will then have already 
been resolved. 

How can I take my file off the site? 

If you would like us to take down an item that you have uploaded, please send an email to info -at- archive.org 
Please note that you need to include the URL (web address) of the item. 

all address you used to upload the item. This is the only way we can tell that 

As always, if you write in, please be sure any spam filter you have is set to accept email from ©archive. org. 
Please see also the further resources at http://www.archive.Org/about/faqs.php#Uploadinq Content 
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3t be done with the item. This 



Arohive.org cannot advise on the potential licensing status of uploaded items. 



You may find these resources helpful 
CreativeCommons.org 
Chilling Effects Clearinghouse 
Electronic Frontier Foundation 



Who owns the rights to these movies? 

http://www.archive.Org /about/faqs.php#49 

Are there restrictions on the use of the Prelinger Films? 

http://www.archive.Org/about/faqs.php#197 

How do I find out information about use of NASA images? 

Can I search Archive.org by Creative Commons License? 

http://www.archive.Org/about/faqs.php#263 
What is non-Commercial Use? 

What is.non-Commercial Use? Please see http://www.archive.orq/iathreads/post-view.php?id=1 1 1 591 
A link the Terms of Use for Archive.org is at the bottom of each page. 
How can I contact the person / group who uploaded an item? 

Internet Archive is unable to release any contact information for patrons. However, it may be worth your while to post a 
review for the item in question - this automatically sends an email to the email address associated with the uploader's 
account, notifying them that their upload has been reviewed. You could pose your queries/requests for information 

Equipment 

What equipment does the Internet Archive use? What APIs? 

Storage systems used by the Internet Archive: 

> Large Scale Data Repository: Petabox htt p://www . petabox.org 

. Datacenter in a shipping container - Internet Archive launch with Sun 

Equipment and software used in the Internet Archive's scanning and OCR services for Contributing Libraries 



Documents describing how to use Archive software and services, maintain "special" servers, and so on. Includes our 
API to archive.org services using JSON format. 

• http://www.archive.org/help 
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Archive-It 

What is Archive-It? How can I use it? 

The Internet Archive Wayback Machine is a service that allows people to visit archived versions of Web sites. 
Visitors to the Wayback Machine can type in a URL, select a date range, and then begin surfing on an archived 
version of the Web. Imagine surfing circa 1999 and looking at all the Y2K hype, or revisiting an older version of your 
favorite Web site. The Internet Archive Wayback Machine can make all of this possible. 

What is Archive-It? 



Archive-It is a subscription service that allows institutions to build and preserve collections of born digital content. 
Through the user-friendly web application, Archive-It partners can harvest, catalog, manage, and browse their 
archived collections. Collections are hosted at the Internet Archive data center and are accessible to the public with 
full-text search. 

Why would I subscribe to Archive-It instead of using the Wayback machine at Internet Archive? 

Subscribers to this service can create distinct Web archives called "collections", containing only the born digital 
content they are interested in harvesting, at whatever frequency suits their needs. All collections are full-text 
searchable. The collections created with Archive-It can be cataloged and managed directly by the subscriber. We keep 
a minimum of two copies of each collection online. These Archive-It features are currently available in the General 
Archive at www.archive.org. 

How frequently can I archive Web sites? 

Archive-It is very flexible: you can harvest material from the Web using nine (9) different frequencies, from daily to 
annual. Subscribers can select different crawl frequencies for each chosen URL. Additionally, your institution can also 
chose to start a crawl "on demand" in the case of an unforeseen spontaneous or historic event. 

Who gets access to the collections created in Archive-It? 

By default, all collections are available for public access from the main page at www.archive-lt.org. However, a 
subscriber can choose to have their collection(s) made private by special arrangement. 

How can I search the collections? 

Archive-It provides full text search capability for ai! public collections. You can also browse by URL from the list 
provided for each collection. The public can browse and search collections by partner type or collection from 
www.archive-it.org. 

What types of institutions can subscribe to Archive-It? 

Archive-It is designed to fit the needs of many types of organizations and individuals. The 95+ partners include: state 
archives, university libraries, federal institutions, state libraries, non government non profits, museums, historians, and 
independent researchers. 

Who decides which content to archive in Archive-It? 

Subscribers develop their own collections and have complete control over which content to archive within those 
collections. 

Where is the data stored for Archive-It collections? 

All data created using the Archive-It service is hosted and stored by the Internet Archive. We store two copies online 
and are working with partners to have redundant copies in other locations at the Bibliotheca Alexandrina in Egypt and 
other locations in the U.S. Subscribers can also request a copy of their data for local use and preservation either on a 
hard drive or over the internet. 
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FREQUENTLY ASKED QUESTIONS 
About the affidavit 
About the Wayback Machine 
About the affidavit 

Do I really need an affidavit from the Internet Archive? 

No. Please consider alternatives to an affidavit from the Internet Archive. Judicial notice and 
stipulation to a document's authenticity are two typical and straightforward options that 
might be used instead of an affidavit. Since our resources are limited, we urge you to 
pursue these alternatives before coming to us with authentication requests. 

What does your standard affidavit look like? 

You can see our Model Affidavit . 
Can the affidavit be notarized? 

Notarizing the affidavit is a strain on the Internet Archive's resources, since there are no 
notaries nearby. If you would like your affidavit notarized, please add $100 to payment, and 
note it in your request. 



The Internet Archive would prefer if you didn't, and will most likely fight it. The Internet 
Archive is a small non-profit, and taking a member of the team for even a few days 
significantly effects what the Archive is trying to accomplish. Please consider alternatives to 
subpoenaing someone from the Internet Archive, including using the standard affidavit or 
judicial notice. 

My request is urgent! Can the Internet Archive provide the documents and affidavit 
immediately? 

No. Unfortunately, given the number of information requests the Internet Archive receives, it 
is not feasible for us to provide anyone with expedited responses. 

However, we recommend that you provide us with a FedEx account number for sending 
your documents and affidavit, since this will speed up your wait time significantly (otherwise, 
we will send your affidavit via regular mail). Please see our Information Request Policy for 
more details. 

Can the affidavit be faxed or sent in some kind of electronic form (ex: on CD or as a 
pdf)? 

The Internet Archive does not have the resources to prepare or deliver your documents in 
any other than printed form. 

Can the Internet Archive change its standard affidavit to fit my needs? 

The Internet Archive may be willing to change its standard affidavit according to particular 
needs, on a case by case basis. However, if we agree to make such changes, you will be 
required to reimburse the Internet Archive for its related attorney fees as we ask our 
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attorneys to review and negotiate any changes to the standard affidavit. If you wish to inquire 
further about this possibility, please contact us via email at info(S)archive.org 

Does the Internet Archive's affidavit mean that the printout was actually the page 
posted on the Web at the recorded time? 

The Internet Archive's affidavit only affirms that the printed document is a true and correct 
copy of our records. It remains your burden to convince the finder of fact what pages were up 
when. 

Can the Internet Archive provide all pages from a specified domain? 

The Internet Archive cannot respond to requests that list one URL and ask for all pages at 
that domain. You must provide us with an extended URL (i.e., the full URL that appears in 
the Address field of your browser) for each page you need authenticated. The extended url 
must come from the Internet Archive's Wayback Machine and not the live web. For example, 
www.archive.org is not an extended url, but 

http://web.archive.org/web/20010812000355/www.archive.org/movies/index.html is. Please 
see our Information Request Policy for more details. 

Does the Internet Archive limit the number of documents I can request at one time? 

The Internet Archive will respond to reasonable requests for documents. If you request a 
substantial number of documents, the Internet Archive may contact you and ask that you 
reduce your request, and the turnaround time on your request may be longer than five 
business days. Please remember that every request puts a strain on the Internet Archive's 
limited resources and small staff, and therefore request only those documents which you 
believe are absolutely necessary to your case. In addition, the Internet Archive reserves the 
right to decline any request it deems to be unreasonable. 

Does the Internet Archive guarantee a turnaround time for responses to requests? 

No. The Internet Archive strives to respond to requests within five business days of receipt of 
payment, but that timeframe is not guaranteed. 

When I send my payment, how will you know that my payment relates to my request? 

If you are sending a check, please also include a copy of your request in the envelope with 
your check as well as an email address where we can contact you. If you are sending 
payment via PayPal, please email infoO.archive.orq immediately after sending your payment 
notifying the Internet Archive that you have just sent payment and identifying your request 
sufficiently for the Internet Archive to understand to which request you are referring. 

Where do I send questions about your information request policy? 

Questions should be sent via email to info@archive.org . 

I submitted an incorrect request, can I have a refund or a credit? 

No. The Internet Archive does not have the resources to set up an accounting system in this 
way. Please double-check your requests for duplicate URLs or errors before submitting 
them. We cannot refund your money or give you a credit. 

Will the Internet Archive take a position in my legal dispute? 

The Internet Archive strives to be a disinterested third party in all disputes involving its 
collection items. If you are using Wayback Machine documents to make a case in your legal 
dispute, the Internet Archive will not take an idealogical or other position in said dispute. 

1 need an affidavit for a case taking place outside of the United States. 

Internet Archive can provide you with authenticated documents and an affidavit in 
accordance with our U.S. policy with the following adjustments: 
-If you cannot provide the archive with an account number to which shipment of the 
documents can be charged, the archive will charge and additional $50-$100 depending on 
the size of your request. 

-Internet Archive will strive to have your documents printed in 5 business days after payment 
is received, however transit time to you is not guaranteed. 

-Internet Archive will accept international wire transfers for international cases only at no 
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expense to the archive; any unexpected wire transfer fees must be paid by you. 

Please remember that the Internet Archive's affidavit only affirms that the printed document 
is a true and correct copy of our records. It remains your burden to convince the finder of fact 
what pages were up when. Additionally, the Internet Archive does not automatically notarize 
affidavits; this is an additional $100 charge. 



About the Wayback Machine 



How can I tell when the pages from the Wayback Machine were archived? 

The Internet Archive assigns a URL to each archived page on its site in the format 
http://web.archive.org/web/rYear in yyyy][Month in mm][Day in dd][Time code in hh:mm:ss]/ 
[Archived URL]. Thus, the Internet Archive URL 

http://web.archive.Org/web/19970126045828 /http://www.archive.org/ would be the URL for 
the record of the Internet Archive home page ( http://www.archive.ora/ ) archived on January 
26, 1997, at 4:58 a.m. and 28 seconds (1997/01/26 at 04:58:28). Typically, a printout from a 
Web browser will show the URL in the footer. 



If a website is designed with "frames," the date assigned by the Internet Archive applies to 
the frameset as a whole, and not the individual pages within each frame. 

Are all the pages associated with a site archived on the same date? 

Probably not. Some users get confused about the temporal browsing that the Wayback 
Machine allows. If a user enters a URL into the Wayback Machine and clicks on a date, that 
date is only for that page. If a user then clicks on a link on an archived page to continue 
browsing, the Wayback Machine will grab the closest date to the one originally requested a 
display it. If the requested page has not been archived, but still available on the live web, the 
Wayback Machine will grab the live page and it will be displayed with today's date in the date 
code. 



For example, a user starts on this page: 

http://web.archive.Org/web/20000619182857/http://www.archive.org / 
which is the June 19th 2000 version ofarchive.org 

Then the user clicks on the "Internet Archive Colloquium 2000 a Success" link: 

http://web.archive.org/web/20000706194131/www.archive.org/news/index.html 

note that the date for this page is July 6th, 2000 

How can I tell what date a particular image was archived? 

The date assigned by the Internet Archive applies to the HTML file but not to image files 
linked therein. Thus images that appear on the printed page may not have been archived on 
the same date as the HTM L file. If you would like to find out when a particular image was 
archived right click (control click from Mac users) and select "open image in new tab [or 
window]". You can also select "copy image location", open up a new tab or browser window 
and paste in the url. Once the image opens look at the url in your browser's address window 
to determine the date the image was captured. Please note that using Microsoft's Internet 
Explorer's "properties" option can be misleading as it displays the same date code as the 
url's HTML file when looking at an image. Its best to open the image in its own window to 
determine the exact capture date. 

I clicked on an archived link and ended up on the live web. What happened? 

The Wayback Machine does its best to allow temporal browsing. If a user enters a URL into 
the Wayback Machine and clicks on a date, that date is only for that page. If a user then 
clicks on a link on an archived page to continue browsing, and the link is not available, 
sometimes the Wayback Machine will redirect to the live web. If we receive a request 
containing files that are not archived we cannot process them nor return payment. Keep an 
eye on your address bar to make sure the file you request is an archived file. 

I've surfed to 

http://web.archive.orq/web/20000706194131/www.archive.o r g/news/index.html . how 
do I find out what dates are available? 
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Replace the date code with a *, and the Wayback Machine will display all dates for that URL: 
http://web.archive.org/webr/www.archive.org/news/index.html 
What do these error messages mean? 
a. Robots.txt Query Exclusion. 

This means that the site is blocked by the siteowner. The Internet Archive was not contacted 
and has no record of when the exclusion took effect. 



b. Blocked Site Error. 



This means that the Internet Archive was contacted by the siteowner and asked to remove 
the site from the Wayback Machine. Pursuant to its document retention policy, the Internet 
Archive keeps the original request for 1 month. 

c. Failed Connection Error. 

This means that the machine that this URL resides on in the database is down and needs 
attention from a system administrator. If your request contains pages with a Failed 
Connection Error, the Internet Archive will need additional time to correct the problem. 

d. Not in Archive. 



This means that this URL is not in the archive, and also no longer available on the live web. 
e. Path Index Error 



A path Index error message refers to a problem in our data base wherein the information 
requested is not available (generally because of a machine or software issue, however each 
case is different). We will look into each instance where this error occurs in your request, 
however the Internet Archive cannot guarantee that all of these errors can be fixed within any 
given timeline. 

What is a frames site, and how do I go about authenticating archived frames pages? 

General information on frames sites can be found on the web. Typically, frames sites have a 
side menu bar that doesn't reload when you click on a button, only the content in the middle 
does. You can also "view page source" on your browser and check the source html for the 
words "frame" or "frameset". 



Since the time stamp on a frames site refers to the parent frame only, it would be in your 
best interest to also request authentication of the child pages. The date on these child pages 
can be found by right clicking anywhere in the child frame and opening it in a new window by 
itself. 

Can the Internet Archive search for pages on the Wayback Machine using particular 
keywords or other search terms? 

No. The Wayback Machine is not like a typical search engine in that it cannot search for 
specific terms or keywords. Therefore, the Internet Archive cannot respond to requests such 
as: "All records containing the term 'Prelinger Archives'" or "All records related to the Web 
site www.archive.org." Instead, provide a list of the extended URLs for each page on the 
Wayback Machine that you want us to authenticate. 

Does the fact that a particular URL is not in the Wayback Machine on a particular date 
mean the page did not exist on that date? 

The fact that a URL from a particular date is not accessible via the Wayback Machine only 
means that the page is not archived in the Wayback Machine. It does not mean that the page 
did or did not exist on that date. The Wayback Machine does not contain copies of every 
page that ever existed on the Internet. 
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DISCLAIMER: The information displayed here is current as of NOV 06, 2009 and is updated weekly. It is 
not a complete or certified record of the Corporation. 



Corporation 

INTEGRATED INSURANCE TECHNOLOGIES CORPORATION 

Number: C243 1410 ||Date Filed: 1 1/15/2002 ~|| Status: surrender 

Jurisdiction: DELAWARE 

Address 

SELECT QUOTE 

595 MARKET ST10TH FL 

SAN FRANCISCO, CA 94105 

Agent for Service of Process 

ROBERT EDWARDS 

SELECT QUOTE 

595 MARKET ST 10TH FL 

SAN FRANCISCO, CA 94105 



Blank fields indicate the information is not contained in the computer file. 

If the status of the corporation is "Surrender", the agent for service of process is automatically revoked. 
Please refer to California Corporations Code Section 21 14 for information relating to service upon 
corporations that have surrendered. 
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